Introduction to the 

Theory of Coherence and 
Polarization of Light 



EMIL WOLF 





Introduction to the Theory of Coherence 
and Polarization of Light 

All optical fields undergo random fluctuations. They may be small, a.s in the 
output of many lasers, or they may be appreciably larger as in light generated 
by thermal sources. The underlying theory of fluctuating optical fields is 
known as coherence theory. An important manifestation of the fluctuations is 
the phenomenon of partial polarization. Actually, coherence theory deals with 
appreciably more than with fluctuations. Unlike usual treatments it describes 
optical fields in terms of observable quantities and elucidates how such quan¬ 
tities, for example, the spectrum of light, change as light propagates. 

This book is the first to provide a unified treatment of the phenomena of 
coherence and polarization. The unification has been made possible by very 
recent discoveries, largely due to the author of this book. 

The subjects treated in this volume are of considerable importance for 
graduate students and for research workers in physics and in engineering 
who are concerned with optical communications, with propagation of laser 
beams through fibers and through the turbulent atmosphere, with optical 
image formation, particularly in microscopes, and with medical diagnostics, 
for example. Each chapter contains problems to aid self-study. 

Emil Wolf is Wilson Professor of Optical Physics at the University of 
Rochester, and is renowned for his work in physical optics. He has received 
many awards, including the Ives Medal of the Optical Society of America, 
the Albert A. Michelson Medal of the Franklin Institute and the Marconi 
Medal of the Italian Research Council. He is the recipient of se\en honorary 
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the well-known text Principles of Optics (with Max Born. se\enth edition. 
Cambridge University Press, 1999) and Optical Coherence and Quantum 
Optics (with Leonard Mandel. Cambridge University Press, 1995). He has 
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Preface 


. . the image that will be formed in a photo¬ 
graphic camera - i.e. the distribution of intensity 
on the sensitive layer - is present in an invisible, 
mysterious way in the aperture of the lens, where 
the intensity is equal at all points.” 

F. Zemike, discussing coherence in 
a lecture published in Proc. Phys. 

Soc. (London), 61 (1948), 158. 


The optical coherence and polarization phenomena with which this book is concerned have 
their origin in the fact that all optical fields, whether encountered in nature or generated in 
a laboratory, have some random fluctuations associated with them. The monochromatic 
sources and monochromatic fields discussed in most optics textbooks are not encountered 
in real life. 

For thermal light, the fluctuations are mainly due to spontaneous emission of radiation 
from atoms which generate the field. For laser light the fluctuations are due to uncontrol¬ 
lable causes such as mechanical vibrations of the mirrors at the end of the cavity, tempera¬ 
ture fluctuations and, again, arise also because contributions from spontaneous emission 
are always present. For a well-stabilized laser these effects are largely manifested in phase 
fluctuations rather than in amplitude fluctuations and also in the chaotic behavior of the 
laser output that may be detected over a sufficiently long time. In the optical region of the 
electromagnetic spectrum, the field fluctuations are too rapid to be directh measurable. 
The theory of coherence and polarization involves average quantities that can be measured. 
Consequently, the theory deals with observable quantities. 

When ^started writing this book my intention was to provide an introductory text on the 
subject of classical optical coherence theory alone. Although there are now several books and 
book chapters devoted to coherence, none of them seems to me to treat the subject at a level 
appropriate for a reader who has a reasonable knowledge of elementary optics and none 
presents the basic concepts and results of coherence theory in a sufficiently sound and yet 


xi 
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not too abstract manner. By the time I had written several chapters, a new development had 
taken place, namely the discovery that coherence and polarization of light are two aspects 
of statistical optics which are intimately related and can be treated in a unified manner. 
Until then coherence and polarization had been considered as being essentially independ¬ 
ent of each other - the only apparent link was the term “coherency matrix." a 2 X 2 correla¬ 
tion matrix which has been used in the analysis of elementary polarization problems since 
the 1930s. This term is actually a misnomer, because such a matrix has nothing to do with 
coherence, as the term is now understood. Coherence is essentially a consequence of correla¬ 
tions between some components of the fluctuating electric field at two for more) points and 
is manifested by the sharpness of fringes in Young's interference experiment. Polarization, 
on the other hand, is a manifestation of correlations involving components of the fluctuat¬ 
ing electric field at a single point and may be determined with the help of polarizers, rota¬ 
tors and phase plates. Both concepts reflect “degrees of order” in an electromagnetic field, 
but they pertain to somewhat different statistical aspects of it. The theories of coherence 
and polarization are, however, concerned not only with order and disorder in optical fields. 
Their basic tools are correlation functions and correlation matrices which, unlike some 
directly measurable quantities such as the spectrum of light, obey precise propagation laws. 
With the help of these laws one may determine, for example, spectral and polarization 
changes as light propagates, whether in free space or in a medium, which may be either 
deterministic (e.g. a glass fiber) or random fe.g. the turbulent atmosphere). Consequences of 
these laws are among the most useful aspects of the theory. 

Until very recently, coherence phenomena have been studied largely on the basis of scalar 
theory, whereas polarization requires a vector treatment. It was actually a generalization of 
the concept of coherence from scalar fields to electromagnetic vector fields, introduced only 
a few years ago, that has made it evident that coherence and polarization of light, whilst 
distinct phenomena, are just two closely related aspects of statistical optics: and that many 
features of fluctuating electromagnetic fields can be fully understood only when they are 
treated in close partnership. 1 This discovery' has not only enriched both subjects, but also has 
already provided new insights into many aspects of statistical optics. This development, which 
is discussed in the concluding chapter, is likely to find useful applications, for example, in 
connection with optical communication, with imaging by laser radar and in medical diag¬ 
nostics, but undoubtedly other applications will be forthcoming. 

In order to provide a treatment of the subject which is not too demanding mathematically 
and which will help the reader to acquire a working knowledge of it. detailed proofs are 
sometimes omitted. Most of them can be found in M. Bom and E. Wolf, Principles of Optics 
(Cambridge University Press, Cambridge, 7th (expanded) edition. 1999) and in L. Mandel 
and E. Wolf, Optical Coherence and Quantum Optics (Cambridge University Press, 


Developments leading to the recognition that there exists an intimate relationship between the phenomena of 
coherence and polarization are discussed in an article by E. Wolf. "Young's interference experiment and its 
influence on the development of statistical optics" in volume 50 of Progress in Optics (Amsterdam. Elsevier, 
2007). 
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Cambridge, 1995), 1 which also contain more detailed accounts of the subject, with full ref¬ 
erences. The historical development of coherence theory is outlined in an article in Selected 
Works of Emil Wolf with Commentary (World Scientific. Singapore, 2001), pp. 620-633. 
Accounts of the development of the theory' of polarization can be found in E. Collett. Polarized 
Light (M. Dekker, New York, 1993) and in C. Brosseau. Fundamentals of Polarized Light 
(J. Wiley, New York, 1998). 

Some readers may note that parts of the presentation resemble fairly closely the treat¬ 
ments given in B&W' and M&W. This is mainly due to the fact that I had difficulties in pro¬ 
viding a different formulation, but it should be clear that this book uses more elementary 
and less rigorous analysis, aimed at non-specialists, especially teachers and students, who 
might perhaps also find it helpful that problems are included at the end of each chapter. 
Additional problems can be found in M&W. 

I am grateful to Dr. Gale Gant and Dr. Don Nicolson for the photograph of the 20-foot 
Michelson stellar interferometer at Mount Wilson Observatory, built in the 1920s. The pho¬ 
tograph, reproduced as Fig. 3.12, was taken around the year 2000. 

In writing this book I have greatly benefited from the assistance of many colleagues and 
students who read drafts of the manuscript and helped in weeding out errors and improving 
the text. I would particularly like to acknowledge substantial advice that I received from 
Professor Taco Visser and also useful suggestions from Prof. Jannick Rolland, Mrs. Nicole 
Carlson-Moore. Dr. David Fischer, Dr. Olga Korotkova, Dr. Mircea Mujat, Mr. Jonathan 
Petruccelli, Mr. Mohamed Salem, Mr. Thijs Stegeman, Dr. Tomohiro Shirai, Mr. Mayukh 
Lahiri and Mr. Thomas van Dijk. I am also obliged to Mr. Mohamed Salem and Dr. Sergei 
Volkov for help with checking the proofs. 

The staff of the Physics-Optics-Astronomy Library of the University of Rochester pro¬ 
vided much help, especially with locating articles and checking references. I am much 
obliged to Mrs. Patricia Sulouff, the Head Librarian, and to Mrs. Sandra Cherin and Mrs. 
Miriam Margala for their assistance. 

I am greatly obliged to Dr. Greg Gbur for preparing the excellent line drawings of most 
of the figures and also for the beautiful figure which appears on the front cover. 

Some of the research described in this book, especially that connected with the unified 
theory of coherence and polarization discussed in Chapter 9, was supported by the Air 
Force Office of Scientific Research (AFOSR). 1 am much indebted to Dr. Arje Nachman of 
AFOSR for his continued support over many years and for his interest in our work. 

I acknowledge, with many thanks, the very substantial help provided by my secretary, 
Mrs. Ellen Calkins, who, without any complaints, typed and re-typed numerous versions of 
the manuscript and also prepared the author index. 

I wish to express my appreciation to my patient wife, Marlies, who spent long periods in 
solitude whilst I was preparing the manuscript. 

I presented much of the material contained in this book in courses to Physics and Optics 
graduate students at the University of Rochester and at the University of Central Florida; 

References to these books are abbreviated in the present work as B&W and M&W. respectively. 
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but a good part of the text is an expanded version of notes that I prepared for a course which 
I taught for many years at Annual Meetings of the Optical Society of America. I am 
indebted to Dr. Simon Capelin, the Publishing Director for Physical Sciences of Cambridge 
University Press, for suggesting that I expand the notes into a book and for encouraging me 
to do so. 

Finally I wish to express my appreciation to the staff of Cambridge University Press and, 
particularly, to J. Bottrill, the production editor, K. Howe, the production manager and to 
Dr. S. Holt who copy-edited the manuscript, for their cooperation and for having trans¬ 
formed an imperfect manuscript into a beautiful end product. 

Department of Physics and Astronomy Emil Wolf 

University of Rochester 
Rochester, NY 14627, USA 

Spring 2007 
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Elementary coherence phenomena 


1.1 Interference and statistical similarity 

The simplest manifestation of coherence between light vibrations at different points in an 
optical field is provided by the phenomenon of interference. In fact, as we will learn later, 
some features of the interference pattern provide a quantitative measure of the coherence 
between light vibrations at two points in space and at the two instants of time. 

Let us first consider light vibrations at a point P in an optical field. For the sake of sim¬ 
plicity we will ignore, to begin with, the polarization properties of light and we may then 
represent the light vibrations at a point in the field by a scalar, say U(t), If the light were 
monochromatic, it would be expressed as 

U(t) = a cos (<p — cot), (1) 

where a and cp are the (constant) amplitude and phase, respectively, co is the frequency and 
t denotes the time. However, as we have already noted, monochromatic light is an idealiza¬ 
tion which is never encountered in nature or in a laboratory. Light that in some respects 
imitates monochromatic light most closely is so-called quasi-monochromatic light. It is 
defined by the property that its effective bandv, idth. Aco is much smaller than its mean fre¬ 
quency aJ, i.e. that 


w 

For such light the amplitude and the phase are no longer constant and its vibrations at a 
point in space may be represented by a generalization of Eq. (1). viz.. 

U(t ) = fl(f)cos[0(f) - wt], (3) 

where the amplitude a(t) and the phase <f>(t) now depend on time, and generally fluctuate 
randomly. With the help of elementary Fourier analysis one can show (M&W, Section 3.1, 
especially pp. 99-100) that for quasi-monochromatic light alt) and cp(t) will vary very 
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Elementary coherence phenomena 



Fig. 1.1 Illustrating the behavior of the light vibrations U(t) of quasi-monochromatic 
light at a point in space. 


slowly over any time interval At which is short compared with the reciprocal bandwidth of 
the light, i.e. for time intervals 


At < 


2t r 
Aco 


(4) 


as illustrated in Fig. 1.1, where a(|) is indicated by dashed lines. 

We have implicitly assumed that we are dealing with a “steady-state” field. By this it is 
meant that the statistical behavior that underlies the fluctuations of the field does not 
change over the course of time. In the language of the statistical theory which we will 
briefly consider later (see Section 2.1), such a field is said to be statistically stationary. 

Let us now consider vibrations Ui(t ) and U 2 (t) at two points P, and P : in a quasi- 
monochromatic field: 


U i(£) = fl,(f)cos[<pi(f) - cot]. (5a) 

U 2 (t) = a 2 (Ocos[^,(r) - wt], (5b) 

Suppose that we superpose these vibrations at another point P. for example by placing an 
opaque screen across the field, with pinholes at P, and P 2 . Apart from some unessential 
geometrical factors which depend on the size of the pinholes and on the angles of incidence 
and diffraction, which we will assume to be small, the vibrations at the point of superposi¬ 
tion may be represented by the formula 

U(t) = U x (t) + U 2 (t) 

= ^{/icosf^jCO - 53r] + a 2 (7)cos[^ 2 (f) - cot - F 6] (6) 

where 6 is the phase difference introduced between the two beams propagating from P 1 to 
P and from P 2 to P, respectively. 


















































1.1 Interference and statistical similarity 


3 


The instantaneous intensity /(f) at the point P may be defined, in appropriate units, as the 
square of U(t): 


m = uHt) 

= /j(f) + I 2 (t) + /,-,(?). 


(7) 


vs here 


/[(/) = aflOcos 2 ^!* 1 ) - cuf], 

I 2 (t) = aj{t)cos 2 [<p 2 (t) - cot + <5], 


(8b) 


(8a) 


and 


I, 2 (t) = « 1 {cos[<^>!(£) + $ 2 {t) - 2 cot + 6] + cos[(Vf) ~ ~ <*]}• (8c) 

The expression (8c) follows from an elementary trigonometric identity. 

Because of the very rapid fluctuations of optical fields, one can never measure the instanta¬ 
neous intensity but only its average over some time interval — T<t<T which is large com¬ 
pared with the reciprocal bandwidth of the light, i.e. T » 1/A m. To avoid ambiguity, and also 
for reasons which will become evident later when we speak of ergodicity (Section 2.1). we will 
formally let T—* and define the time-averaged intensity by the formula 



(9) 


We attached the suffix t on the angular brackets to stress that the brackets refer to a time aver¬ 
age and also to distinguish it from another type of average which we will encounter later. 

Suppose that the amplitudes of the held at the points P\ and P 2 are effectively independ¬ 
ent of time (as is approximately true for the output of a well-stabilized single-mode laser) 
and also that they are equal to each other, i.e. that ctflf) = a 2 (t) = a = constant. Then, on 
taking the time average, we obtain from Eqs. (7) and (8) the following expression for the 
average intensity at the point of superposition: 



( 10 ) 


where 



(11a) 


</ 12 ), = fl 2 (cos[<^(r) - - /-],. 


(lib) 


In deriving the averages given by Eqs. (11) we used the facts that cos 2 [o(f) — cut]), = 1/2 
and (cos[^](r) + <p 2 (t) ~ 2cur + 5]), = 0. 
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The formula (10) shows that the average intensity at the point of superposition is the sum of 
the averaged intensities (/ 1 ), and (I 2 ), of each beam and of a term, {/ 12 )„ which represents the 
effect of interference between the two beams. Were the beams strictly monochromatic, the 
phases <f)\ and <p 2 would each be independent of time and the interference term (/ I2 ), would 
differ from zero, except at certain points, for which 6 takes a value that makes the cosine term 
vanish, ft is, however, evident from the expression (fib) that an interference term may be 
present even if the phases fft) and are not constant: in fact, even if they fluctuate ran¬ 
domly. Suppose, for example, that fft) and <p 2 rf) undergo random fluctuations but that 

<p 2 (t) - <st>i(r) = A (12) 

where 0 is a constant. Then evidently the interference term (/ 12 ), will be non-zero, except for 
special values of 6, just as in the idealized case of strictly monochromatic light. The simple 
relation (12) may be said to be an example of statistical similarity 1 between the vibrations at 
the two points. Thus we have shown that, in order to obtain interference, light need not be 
monochromatic It is necessary only that the interfering beams possess some statistical sim¬ 
ilarity, which is the essence of coherence. This concept then acquires a precise quantitative 
meaning in terms of the so-called degree of coherence of light. 


1.2 Temporal coherence and the coherence time 

We will now introduce by means of two well-known experiments some elementary con¬ 
cepts relating to coherence. 

Suppose first that we divide a steady-state 2 quasi-monochromatic light beam from a source 
S into two beams in a Michelson interferometer (Fig. 1.2) and that the two beams are brought 
together after a path delay e A t (c is the speed of light in vacuum) has been introduced between 
them. If A t is sufficiently small, interference fringes are formed in the detection plane 3. Tire 
appearance of the fringes is said to be a manifestation of temporal coherence between the two 
beams, because the contrast of the fringes depends on the time delay A t introduced between 
them. It is known from experiments that interference fringes will be observed only as long as 


Af 


27T 

Aw 


( 1 ) 


1 A precise meaning of the concept of statistical similarity can be given in the framework of the theory of station¬ 
ary random processes. The concept is relevant not only in connection with coherence but also in connection 
with polarization [H. Roychowdhury and E. Wolf, Opt. Commun. 248 (2005) 327-332.] 

By steady state we mean here that the averaged intensity 

i r T 

— I /(?„ + r)dr 
2 T J t 0 

taken over an interval 2T » 1/A<a is independent of the choice of r 0 . A more precise definition involves the 
concept of statistical stationarity which we will encounter in Section 2.1. Whilst most ordinary laboratory 
sources and sources found in nature (e.g. the stars) are of this kind, laser pulses do not belong to this category. 
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Fig. i .2 Illustrating the concept of coherence time of light by means of the Michelson 
interferometer. M\ and M 2 are mirrors. M a is a beam splitter (semi-transparent mirror). For 
the sake of simplicity, a compensating plate and a collimating lens system are not shown. 

To obtain interference fringes, one of the mirrors must be tilted with respect to the axis. 

This result may be understood theoretically by considering the interference pattern formed 
by each spectral component of the light and estimating the time delay for which the indi¬ 
vidual intensity patterns get out of step, eventually canceling out. This time delay is called 
the coherence time of the light and the corresponding path delay 


M = 


2tvc 

Aw 


AA 


( 2 ) 


A being the mean wavelength and AA the effective wavelength range. This quantity is 
called the coherence length of the light. 

We will illustrate the formula (2) by simple examples. Consider first thermal light, such 
as would be generated by incandescent matter or gas discharge with a broad spectrum. The 
bandwidth Am is typically of the order of 10 8 s _1 . the coherence time A t is of the order of 
10~ 8 s and its coherence length At ~ 27r X 3 X 10 1 cms _1 /10 8 s“ ~ 19m. 

Let us compare this result with light generated by a well-stabilized laser, with 
Aw ~ 10 4 s -1 . The coherence time of such light is of the order of 10~'s and its coherence 
length Af ~ 190km, i.e. 10 4 times longer. 


1.3 Spatial coherence and the coherence area 

Another experiment which elucidates some aspects of coherence is Young's interference 
experiment. Suppose that quasi-monochromatic light, assumed for now to originate in a 
thermal source S , illuminates two pinholes in an opaque screen A (Fig. 1.3). For simplicity 
we assume that the arrangement is symmetric, with the source having the form of a square 















6 


Elementary coherence phenomena 



Fig. 1.3 Illustrating the concept of spatial coherence by means of Young's interference 
experiment with steady-state light. 


of side A.v. If the pinholes at Q\ and Q 2 are sufficiently close to each other, interference 
fringes will be observed on a detecting screen B throughout a neighborhood of the axial 
poinl provided that 


Av M < A. (1) 

where 2 A# is the angle which the line Q\Q 2 subtends at the source. A rough elementary 
derivation of Eq. (1) may be obtained by considering the intensity distribution at the obser¬ 
vation plane tSto arise from the superposition of intensities of independent interference pat¬ 
terns formed by light from different elements of the thermal source. The relation (1) then 
follows on requiring that the individual interference patterns remain approximately in step. 

If the plane BL of the pinholes is at a distance R from the source plane, the fringes will be 
observed in the neighborhood of the axial point P„ in the observation plane % provided that 
the pinholes are situated within an area Aof size 

AH ~ (R A6) 2 = ^A \ (2) 

where S = (Av) 2 is the area of the source and the relation (1) was used. This area is said to 
be the coherence area of the light in the plane A around the axial point Q {) . We see that it 
increases quadratically with the distance R from the source plane. However, the solid angle 
AQ which the coherence area subtends at the source is independent of the distance, being 
given by the formula 



S 


(3) 
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Although the preceding discussion has been contined to light from a thermal source, the 
concept of coherence area applies more generall) However, the formula (2) applies only 
when the pinholes are illuminated by thermal light. 

It is sometimes useful to express the coherence area in an alternative form that involves 
the solid angle 


AW = 



(4) 


which the source subtends at the axial point Q 0 . It follows at once from Eqs. (4) and (2) that 


AM 


A 2 

AW' 


(5) 


It is of interest to compare the coherence areas generated by various thermal sources. Let 
us consider first a planar thermal source of area 1 mm 2 emitting quasi-monochromatic ther¬ 
mal light of mean wavelength A = 5,000 A (500 nm) and illuminating a plane if parallel to 
it. at a distance R = 2 m. According to formula (2) the coherence area in that plane is 

AM = (2m — X (5 X 10 -7 m) 2 = 1 mm 2 . (6) 

10 6 m 2 


Let us compare this result with the area of coherence of filtered sunlight illuminating the 
Earth's surface. Suppose that the sunlight is passed through a filter with a narrow passband 
around A = 5,000 A. The angular radius that the Sun’s disk subtends at the surface of the 
Earth is approximately a ~ 0°16 'm 0.00465 radians. Hence, if we neglect limb darken¬ 
ing. the solid angle AW that the Sun’s disk subtends at the Earth's surface is. therefore. 

AW ~ 7m 2 «=> 3.14 X (4.65 X KT 3 ) 2 «* 6.79 X 10~ 5 sr. (7) 


Hence, according to Eq. (5). the coherence area of sunlight on the Earth's surface is 


AM 


(5 X 10 5 cm i 2 
6.79 X 1(T 5 


~ 3.68 X 10 3 mm-. 


( 8 ) 


Hence the linear dimension of the coherence area on the surface of the Earth of filtered sun¬ 
light is of the order of (3.68 X 10~ 3 ) 1/2 mm ~ 0.061 mm. It is of some historical interest to 
mention that this value agrees with an estimate made in the 1860s by the French scientist 
E. Verdet. who used only a primitive notion of the concept of coherence. 

Let us compare the coherence area of sunlight on the Earth’s surface with that produced 
by a distant star. For this purpose we recall that according to Eq. (5) the coherence area is 
inversely proportional to the solid angle AW which the source subtends at the axial point 
<2 0 of the plane where the coherence area is to be estimated. Since the angular diameter of 
a star when viewed from the Earth’s surface is many orders of magnitude smaller than that 
of the Sun. one must expect that the area of coherence of star light at the Earth's surface 
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will be very much greater than that of the sunlight. As an example, let us consider the coher¬ 
ence area formed on the Earth’s surface by the star Betelgeuse (a-Orionis), which was the 
first star whose angular diameter was determined by an interferometric technique (dis¬ 
cussed in Section 3.3.1). It was found to have an angular diameter of 2 a — 0.047 seconds 
of arc (2.3 X 10 7 radians). The solid angle which this star subtends at the Earth’s surface 
is, therefore, AfT = 7 tq 2 — 4.15 X 10” l4 sr. Hence the coherence area on the Earth’s sur¬ 
face of the light from this star, when passed through a filter which transmits a narrow band 
around the wavelength A = 5,000 A, is. according to Eq. (5), 


. (5 X IQ- 7 m) 2 

4.15 X 10 14 sr 


6 nr. 


(9) 


It is evident from this estimate that there is appreciable “statistical similarity’’ (spatial coher¬ 
ence) between vibrations in the filtered light from the star reaching the Earth’s surface up to 
a separation of about V6 nr — 2.45 m. Many stars have angular diameters which are 
appreciably smaller than that of Betelgeuse and, consequently, light reaching the Earth 
from such stars will be spatially coherent over much larger areas. 

These results make it clear why stellar images formed in the focal plane of a telescope 
have the appearance of diffraction images formed by spatially coherent light: for our analy¬ 
sis shows that there exists a high degree of correlation in the star light entering the aperture 
of the telescope over areas which are generally much larger than the area of the aperture. 


1.4 The coherence volume 


Let us now consider a field that is approximately a plane, quasi-monochromatic, steady- 
state (i.e. statistically stationary) wave. The right-angled cylinder whose base is the coher¬ 
ence area in a plane normal to the direction of propagation of the light and whose height is 
the coherence length may be called the coherence volume (Fig. 1.4). It occupies a domain 
of space whose volume is 


AV = Af AA 


( 1 ) 


The coherence length At is given by Eq. (2) of Section 1.2 and the coherence area, for ther¬ 
mal light, is given by Eq. (2) of Section 1.3. On substituting from these equations into 
Eq. (1) we obtain the following expression for the coherence volume of thermal light: 



A 

AA 


A 3 


(2a) 


or, if Eq. (4) of Section 1.3 is used. 


AV 


AO' 


AA 


A 3 . 


(2b) 
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Fig. 1.4 Illustrating the concept of coherence volume AV in a steady-state held of a 
thermal, quasi-monochrontatic, approximately plane wave. 


Let us estimate the coherence volume for the examples we considered a moment ago in 
connection with the coherence area. We will assume that in each case the effective wave¬ 
length range of the filtered light A A = 5 X 10 -4 A with mean wavelength A = 5.000 A. In 
this case the coherence length is, according to Eq. (2) of Section 1.2, about 5 nt. For a pla¬ 
nar thermal source of area 1 mm 2 we found fEq. (6) of Section 1 3] that the area of coher¬ 
ence around the axial point Q 0 in a plane parallel to the source and at a distance R = 2 m 
from it has the size AA = 1 mm’. Hence, according to Eq. (1) of Section 1.4 the coherence 
volume 


AV ~ (10 ’em) 2 X (5 X 10 : cm) = 5cm'.. (3) 

For filtered sunlight reaching the Earth we found [Eq. (8) of Section 1.3] that AA ~ 
3.68 X lCT 3 mmr and hence, using Eq. (1), the coherence volume of sunlight on Earth’s 
surface is 


AV ~ 3.68 X 10“ 3 mnr X 5 X lO'nrm ~ 18 mm 3 . (4) 

For filtered light from the star Betelgeuse, we found that A A — 6 m 2 [Eq (9) of Section 1.3]. 
so that 


AE~ 6m 2 X 5m = 30m’. (5) 

The formulas (2) pertain to radiation from a thermal source. However, the concept of 
coherence volume applies much more generally. Consider, for example, light from a com¬ 
mon-type laboratory helium-neon laser. Let us assume that the cross-section of the laser 
beam is 1 mm - and that the mean wavelength of the light is A = 6 X 10 cm. Over a short 
time interval, of the order of a few seconds, one can easily achieve stability that ensures a 
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narrow bandwidth Aw ~ 10 6 Hz, implying an effective wavelength range AA ~ 1.2 X 
1CT 1 ’em. According to Eq. (2) of Section 1.2 the coherence length during such a short time 
interval will be of the order of 


AC = 2 it X 


3 X 10 8 ms" 1 
10 6 s“‘ 


** 1900 m. 


( 6 ) 


Assuming that the laser beam is spatially completely coherent over its whole cross-section 
(which would be the case if the laser operated on a single mode). Eq. (1) then implies that 
the coherence volume is 

AV~ 1900m X 10~ 6 m 2 = 1.9 X 10~ 3 m 3 . (7) 

The concept of coherence volume has a counterpart in the quantum theory of radiation, 
where it represents the so-called cell of phase space. It is the domain in a six-dimensional 
phase space, formed by the three position coordinates and three momentum coordinates, 
throughout which photons are indistinguishable. We will briefly discuss it later [Appendix 
1(a)]. 


PROBLEMS 


1.1 N quasi-monochromatic real scalar waves are superposed at a point P. The waves have 
constant amplitudes and the same mean frequencies. Derive an expression for the 
time-averaged intensity at the point P when the phases of the w'aves 

(1) are constant; 

(2) vary in unison, i.e. they differ from each other only by constants; 

(3) fluctuate randomly and independently of each other. 

1.2 The spectral density of blackbody radiation is given by Planck's law 


S(y) = 


87 rho 2 1 

c 3 e lwHk B T) _ | 


where h is Planck's constant, k B is the Boltzmann constant, c is the speed of light in 
vacuum. T is the absolute temperature and v = w/(2 tt) is the circular frequency. 

Estimate the coherence length of a beam of radiation whose spectrum is given by 
Planck's law and plot it as a function of k B T. 

1.3 Betelgeuse in the constellation of Orion w'as the first star w hose angular diameter was 
measured. It was found to be 0.047 seconds of arc. Determine the coherence area 
of the filtered light from this star on the Earth's surface, at the mean wavelength 
A = 5.75 x 10~ 5 cm. 
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2.1 Elementary concepts of the theory of random processes 

The elementary concepts that we have introduced rather heuristically in the preceding chap¬ 
ter can be made more precise with the help of mathematical methods developed for the 
analysis of random phenomena. This branch of mathematics is known as the theory of ran¬ 
dom processes or the theory of stochastic processes. In this introductory text we will not delve 
too deeply into the mathematical theorems and their proofs; rather we will try to get 
acquainted with only the basic concepts and results of the theory in order to help us to gain 
a deeper understanding of optical coherence and polarization effects. 

Let us consider a real field variable x(t). It may represent, for example, a Cartesian com¬ 
ponent of a steady-state electric field at some point in space, at time t. The exact behavior 
of -v(r) over the course of time cannot be predicted, because, as we noted in the preface, any 
realistic field always undergoes random fluctuations. 

Suppose that ,v(r) is measured in a series of very similar experiments and that 1 v(r), 2 .v(f), 
3 .v(f),... are the outcomes of such experiments, as indicated in Fig. 2.1. We then speak of 
an ensemble of realizations or of an ensemble of sample functions of the random function 
,x(t). Of course, such measurements cannot be performed at optical frequencies because of 
the rapidity of variation of the field, but conceptuall;, the existence of the ensemble is clear 
and experiments of this kind can be performed w ith radiation at lower frequencies. 

We have already encountered the concept of a time average [Eq. (9) of Section 1.1]. For 
a typical realization k x(t) of the ensemble, the time average is defined as 



0 ) 


One may define, in a similar w av, the time average of a deterministic function F(.\). w here 
x is a sample function k x(t) of the ensemble of x: 



( 2 ) 
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Fig. 2,1 An ensemble of realizations (sample functions) 4(0 (j - 1.2, , .of a ran¬ 
dom variable x(t). 
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For example, if F(x) = x 1 then 

= Lim-i- f\ J*x(r)] 2 dr. (3) 

In the theory of random processes one also detines another kind of average, called the 
ensemble average or the expectation value, which may be introduced as follows. Consider 
first the averaged sum 

— ['.Y(f) + 2 x(t) + ■■■ f A x(f)], (4) 

N 

and let us formally proceed to the limit as N —>* oo. One then writes 

| N 

(x(t)) = Lim — *40, (5) 

N-><» N fT) 


which is one form of the ensemble average (average over an ensemble of realizations) of 
x(t), denoted by the angular brackets with subscript e. 

In a more abstract way one defines the ensemble average by means of an integral rather 
than by a sum. For this purpose one introduces the probability density, p,(x, t), with the fol¬ 
lowing meaning. The quantity p\(x, f)dv represent the probability that the random field vari¬ 
able will take on a value in the range (x, x + dv) at time t. The probability density /?, can be 
estimated from an ensemble of realizations of x (Fig. 2.1). Suppose that among a large 
number N of realizations the value of x is found within the range (x, x + dv) n times. Then 

P] (x,t)~^. (6) 

The ensemble average of x(t). which may be regarded as a natural generalization of the def¬ 
inition (5), is then defined, in view of Eq. (6). as 

(x(r)) e = / v/),(x. ndv. (7) 

where the integration extends over the whole range of values which x can take. We stress 
that, although the random variable x is time-dependent, the variable x of integration under 
the integration sign on the right-hand side of Eq. (7) is taken to be independent of time. 

The formula (7) defines the simplest kind of ensemble av erage, namely the mean of x(r). 
We will also need more general averages, for example, the ensemble average of F[x(/)], where 
Fix) is again a deterministic function of x such as. for example, r 2 or sin x. The ensemble 
average is then defined by an obvious generalization of the definition (7), namely as 

(F[x(t)]) c = J F(x)p i (x. ridv. (8) 

Before proceeding further with the account of the basic aspects of the theory of random 
processes we mention the form of the probability densities for thermal light and for laser 
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light. For thermal light such as generated, for example, by incandescent matter or a gas dis¬ 
charge, the probability density of the field variable, U say, is 1 


P t (U) = 


1 p-u-kii) 


(9) 


where (I) = (U 2 ) is the average intensity, the average being taken over the statistical 
ensemble. On the other hand, the probability density p } of the output of a single-mode laser 
is well approximated by the expression 


P { W) 


— -- when \U\ < yj{I), 

xjj)- U 2 

0 when |€/| > yf(J), 


( 10 ) 


as was first shown by L. Mandel ‘ 

Plots of the two probability densities (9) and (10) are shown in Fig. 2.2. We note some 
basic differences between them. For thermal light, the value U = 0 is the most probable 
value, whereas for the output of a well-stabilized laser it is the least probable value, the 
most probable value being the square root U = of the stabilized intensity of the 

laser output. Also, for thermal light there is always a finite (though possibly very small) 
probability that U will take on any value, whereas for the output of a single-mode laser 
there is zero probability that |t/| > J{1) . 

Retuming to the general case, the probability density p |(.v, t) makes it possible to define 
some ensemble averages, but is by no means adequate to characterize the statistical prop¬ 
erties of a random function. This is because p fix, t) depends on only one time argument. It 
cannot, therefore, provide answers to questions concerning the statistical behavior of a ran¬ 
dom variable at several instants of time (;, t 2 , .. ., t,„ and consequently about averages of 


1 This result is a consequence of a general theorem of probability theory, known as the central limit theorem (see 
M&W, Section 1.5.6), It asserts that, under very general conditions, the probability distribution of the sum of N 
independent, or w'eakly dependent, random variables tends, with increasing N, towards a Gaussian distribution. 
If we identify the random variables with the contributions to the total field arising from the different source ele¬ 
ments (atoms), which at usual laboratory temperatures radiate independently by the process of spontaneous 
emission of radiation. Eq. (9) follows. 

" L. Mandel in Quantum Electronics.. Proc. Third International Congress, N. Bloembergen and P. Grivet eds. 
(Columbia University Press. New York: Dunod, Paris. 1964). pp. 101-109. See also L, Mandel and E. Wolf, 
Rev. Mod. Phys. 37 (1965). 253. The same distribution was derived from a somewhat different model by J. W, 
Goodman. Statistical Optics (J. Wiley, New York. 1985), Section 4.4.1, _ 

The singular behavior of the probability density (10) when l£/| = f(I) is a consequence of the rather 
idealized model of the output of a single-mode laser. When a more realistic model is used, which takes into 
account the presence of phase fluctuations caused, for example, by thermal fluctuations or vibrations of the 
mirrors at the ends of the laser cavity, the probability distribution will not be infinite when |t/| = M) but 
rather it w ill have a sharp maximum. Plotted as a function of arg (./the probability will then have the well-known 
doughnut shape [see, for example. O. Svelto. Principles of Lasers, third edition (Plenum Press. New York, 
1989). Section 7.4], 
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Fig. 2.2 The probability density Pi(U) of the field variable U for light from a thermal 
source and from a single-mode laser. [Adapted from L. Mandel in Quantum Electronics , 
Proc. Third International Congress, Eds. N. Bloembergen and P. Grivet (New York, 
Columbia University Press and Paris. Dunod, 1964, pp. 101-109).] 


products such as ,v(r]).v(t 2 ). To characterize a random process one needs a sequence of prob¬ 
ability densities 

P](a,: t i) r Pz(x u x 2 i tu t 2 ), P3 (a'i, x 2 , -r 3 ; t ,, f 2 , f 3 )- 

The quantity p 2 (x ], x 2 , t l , ii)dv, dv 2 represents the probability that the variable x will take on 
a value in the range (aq, .v, + dr,) at time t\ t and a value in the range (x 2 , x 2 + dv 2 ) at time 

r-. The higher-order probability densities p 3 , p 4 .have a similar meaning. One can then 

answer, for example, questions such as "what is the ensemble average of the product 
.vU|).v(f 2 )?” It is given by the formula 

(x(t l )x(t 2 )) e = f f x v x 2 p 2 (. Vj, ,v 2 ; t v uidv, dv 2 . (11) 

We have assumed up to now that the random function is a real function of t. Later v> e will 
frequently encounter complex random functions of t, sa\ 

z(0 = x(t) + iy(t), (12) 

where x(t) and y(l) are real. The concepts that we have just been discussing may readily be 
generalized to such situations, as we will now indicate. 

The statistical properties of a complex random process z(t) are characterized by a 
sequence of probability densities 

P\(zu h), p 2 {zi, z 2 ; h, t 2 ), p 3 {z i, c 2 . z 3 ; L- U. t 3 ) - 
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The first one, when multiplied by d 2 z = dbc : dy ; . represents the probability that at time t\ 
the random variable z will take on a value represented by a point located in the element 
d 2 £] = djqdvi around z\ = Xj + ivj in the complex -plane (see Fig. 2.3). The second 
probability density, p 2 (z i, Z 2 , t\. h)- when multiplied by the product d 2 ^ d 2 j 2 = drj dy, d.v 2 
dy 2 , represents the probability that the random variable i will take on a value represented by 
a point in the element d 2 ^ = dt) dyj around the point c, = + by at time t\ and a value 

represented by a point in the element d 2 ; 2 = dv 2 dy 2 around the point = x 2 + iv 2 at time 
h. The higher-order probability densities p 3 , p 4 .. . . have similar meanings. 

Ensemble averages involving complex random variables are defined by an obvious gen¬ 
eralization of those involving real variables. For example 

(s*(f,)*(/ 2 )) = / I \z 2 p 2 {z v z 2 \ t v ? 2 )d 2 " 1 d 2 z 2 , (13) 

where asterisks denote the complex conjugate and the integration extends over all possible 
values of z\ and z 2 which the complex variable e can take on. 

We spoke earlier about a steady state. The concept of a steady-state process may be 
regarded as a non-technical term for what is known as a stationary 1 random process. By this 
is meant a random process whose properties do not depend on the origin of time: more pre¬ 
cisely, a random process for which all the probability densities p\,p 2 , p 2 ,. . . remain invari¬ 
ant under translation of the origin of time. Expressed mathematically, this means that 

Pn(Z\> 22, • • •• Z»; h + T. t 2 + r. t„ + r) = P„{z u z 2 . z„; t u t h .. ., t n ) (14) 

for all values of r and for all positive integers n. Examples of such a situation are light 
vibrations at a point in the focal plane of a telescope forming the image of a star (ignoring 
the possibly finite life-time of the star). 

For a real stationary' field U(t) the average intensity, {/(?)}, will be independent of time, 
because 

</(0) e = (U 2 (t)) e = f U 2 p l (U. t)dU. (15) 



dv 


Fig. 2.3 Illustrating the meaning of the element d : c relating to the probability/>,(;, t)d z z 
(d : c = dv dy). 
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and pi does not depend on t because it has to remain unchanged under translation of the ori¬ 
gin of time. Similar remarks apply to a complex stationary random held V(t), for which the 
average intensity {I(t)) e = {V*(t)V(t)) e . 

We are now familiar with two kinds of averages, the time average and the ensemble aver¬ 
age Fortunately it turns out that, when one is dealing with stationary fields, these two aver¬ 
ages usually have the same value. This is the essence of so-called ergodicity. a concept 
which we will now briefly discuss. 

2.2 Ergodicity 

In a broad sense a statistically stationary process is said to be ergodic if the time average, 
taken over the interval — < t < x, of any deterministic function T(£;) of a typical real¬ 

ization £ = k x(f) of the random process is equal to the corresponding ensemble average. 


i.e. if 


(F[ k x(t)]) r = 


( 1 ) 


and. more generally, if F{y b y 2 , . . y n ) is a deterministic function of several variables v,. 

vy, . . v„, then, for an ergodic ensemble 



( 2 ) 


Such equalities may seem to be somew'hat surprising at first sight since, for example, in Eq. (1) 
the expression on the left-hand side seems to depend on the particular choice k x(t) of a realiza¬ 
tion of the random process and is independent of time, w hereas the expression on the right- 
hand side does not depend on any particular realization and seems to depend on time. The 
following example may perhaps indicate w hy the equality of the tw o types of averages might 
be expected. 

Consider a particular realization k x(t) of a stationary random process, such as that shown 
in Fig. 2.4(a). Let us imagine that we divide it into long segments, w hich we label as ®, ®. 

(3).each of duration 2T and let us place them under each other, as in Fig. 2.4(b) For an 

ergodic ensemble we may expect that all the individual segments w ill be representative of 
the ensemble of x{t) of which k x(t) is a member. Since the ensemble shown in Fig. 2.4(b) 
w as derived from the single realization k x(t) of the original ensemble, one might expect that 
the statistical information which can be deduced from the single realization, (a), and from 
the ensemble of realizations, (b), w ill be the same. 

We will assume from now’ on that we are dealing with ensembles that are stationary and 
ergodic. Consequently, it will not be necessary to distinguish between time averages and 
ensemble averages and we will, therefore, omit from now on the subscripts t and e on the 
angular brackets that denote expectation values. 




k x(t) 



ensemble of a stationary random process x(t) is contained in a single sample function 
*v(?) of the process. Dividing such a sample function into many parts. ®, 

L(a)] provides a valid ensemble [(b)] of realizations (sample functions) of x(r). 
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2.3 Complex representation of a real signal and the envelope 
of a narrow-band signal 

In the usual treatments, when one deals with a real monochromatic wavefield, it is often 
convenient to associate with it a complex monochromatic wavefield (see. for example, 
B&\\, Section 1.4.3). The use of the associated complex field simplifies calculations, espe¬ 
cially if they involve averages of quantities that are quadratic in the field \ariable. e.g. the 
intensity. There is a similar procedure, which we will now describe, which is useful when 
one is dealing with real random fields such as, for example, the wavefield generated by a 
thermal source. 

Let U(t) be a fluctuating real quantity, which might represent, for example, a Cartesian 
component of the electric field at some point in space. Let us express it in the form of a 
Fourier integral, 


U(t) = j‘ n(w)e ,M dw. (1) 

Since U(t) is real, it is equal to its complex conjugate and. using this fact, it readily follows 

that 

u(-w) = u(oj), (2) 

where the asterisk denotes the complex conjugate. This result implies that the negative fre¬ 
quency components contain no information that is not already contained in the positive 
ones. For this reason we may omit the negative frequency components, i.e, we may use in 
place of the function U(t) the function 

V(t) = J* «(n))e -lw, d<u. (3) 

\\t) is known as the complex analytic signal associated with the real signal U(t). The name 
derives from the fact that the function Vi t ), when considered as a function of complex t. 
may be shown to be analytic in the lower part of the complex t plane (M&W, Section 3.1). 
Moreover, one can also show that [M&W. Eq. (2.1-57)] 

V(t) = ^[W r) (0 + iVW(/)L (4) 

where the functions W'jf) = U(1) and V" ] (i) form a so-called Hilbert-tranxform pair (some- 
.mes referred to as a conjugate pair); and, moreover, if V /U, (r) is a stationary random vari¬ 
able. then 


{[v^w) = \ty\mt))- (5) 

Since the expectation value often represents the average intensity, the formula (5) shows that the 
expectation value {V*(f)V(f)) may also be taken to be a measure of the average intensity. 
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Analytic signals are particularly useful in representing the envelope of narrow-band 
fields To see this let us first consider a strictly monochromatic real signal x{t) of frequency 
w 0 , i.e. 


x(t) = a — w 0 t). (6) 

where a and are the amplitude and phase factors of the signal, both being constants. We 
may rewrite Eq. (6) in the form 

,r(0 = + I$ 0 V“"', (7) 


where 


4 = a&*. ( 8 ) 

The formula (7) implies that the Fourier spectrum of x(t) consists of two delta functions, 
centered at frequencies cu 0 and —cu 0 [see Fig. 2.5(a)]. 

Next let us suppose that x(t) is not strictly monochromatic but rather that it has a narrow 
bandwidth Acu which is small relative to its mean frequency w, i.e. that 



co 


(9) 


Such a narrow -band signal is said to be qiiasi-monochromatic. A ty pical Fourier spectrum 
of such a signal is shown in Fig. 2.5(b). One often represents it in a form that is a general¬ 
ization of that given by Eq. (6), namely as 

,v(r) = a(r)cos[0(O - cot]. (10) 


The amplitude and the phase of such a signal are no longer constant as they are for a mono¬ 
chromatic signal, rather they vary with time. However, as is intuitively perhaps obvious and 
can be verified by elementary Fourier analysis, their variations are very slow compared 
with the variation due to the term lot. remaining essentially constant over an interval that is 
small compared with the reciprocal bandwidth 1/Acu. It should be noted, however, that the 
representation (10) is somewhat ambiguous, because it associates with a real function x(t) 
two real functions, a(t) and i/>(f), and such an association is not unique. The analytic signal 
representation makes it possible to avoid the non-uniqueness. One defines the complex 
envelope by associating with x(t) the analytic signal [cf. Eq. (4)] 

z(t) = +iv( 0 L (11) 


where y(f) is the conjugate (Hilbert transform) of x(t). We can express y(f) in the form 


y(t) = a(f)sin[V>(?) - cot]. 


(12) 
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(a) 

KC 



— lCq 

0 


KM | 



Fig. 2.5 The Fourier spectrum of a monochromatic signal of frequency a> 0 is shown in 
(a) and that of a real quasi-monochromatic signal of mean frequency w and bandwidth 
Aat <S io is shown in (b). 


W e now have two equations, namely (12) and (10). from which ait) and v(t) may be 
■rquely determined (apart from a trivial ambiguity of an additive phase factor 2?n~, where 
m is an integer). 

The function 

2 z{t) = [.v(0 + iyU)] 

= (13) 


may be identified w'ith a complex envelope of the narrow-band signal xit). 

We have so far regarded x(t) as a deterministic function. Suppose instead that if t) is a sta- 
aonary quasi-monochromatic random function whose spectral density has an effective 
tandw idth Atu which is much smaller than the mean frequency w. Such a random function 
an be represented by an ensemble of quasi-monochromatic signals. It is clear that its sta¬ 
tistical properties are entirely reflected in the statistical behavior of the amplitude function 
>atl and of the phase function iff). 
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2.4 The autocorrelation and the cross-correlation functions 

The two most important expectation values associated with a real random process x(t) are 
its mean 


m{t) = (x(0) (1) 

and its autocorrelation function (also known as the covariance function) 1 

R(t v t 2 ) - {.v(r,)*(4)). (2) 

Let us assume that the process is statistically stationary. The mean will then be independ¬ 
ent of the time argument t and the autocorrelation function will depend on the two time 
arguments r, and t 2 only through the difference r = t 2 — t\. We can then write R(t) in place 
of R(t !, t 2 ), i.e. we write 


R(r) = (x(t)x(t + t)). (3) 

This function provides, for a fixed r, a quantitative measure of the intuitive concept of sta¬ 
tistical similarity which we mentioned towards the end of Section 2.1. Let us plot a realiza¬ 
tion k x{t) and its “shifted version" k x(t + r) as shown in Fig. 2.6, it being assumed, for 
simplicity, that ( k x(t)) t = 0. Evidently R(t ) will have its maximum value when r = 0. As r 
increases R(t), in general, decreases because some of the positive values of.v(r) will be can¬ 
celled out by the negative values of x(t + r). In fact one can show that, if the process is 
ergodic. R(t) —* 0 as r—»x. It is clear that the effective width of R{r) is a measure of the 
time over which there exist correlations (some statistical similarity) between x(t) and 
if t + t). The effective width of Rtr) is evidently a more precise measure of the concept of 
coherence time which we introduced earlier rather heuristically by an order-of-magnitude 
relation [Eq. (1) of Section 1.2], 

The autocorrelation function R(t ) has a number of useful properties, of which some fol¬ 
low immediately from its definition and others can be readily derived. The most important 
ones are 


R( 0 ) > 0 . 

(4a) 

R( t ) = R(t), 

(4b) 

|/?(r)| < R( 0 ). 

(4c) 


1 If the mean m ft) of x(t) is not zero, one often uses instead of the autocorrelation function t 2 ) the centered 
autocorrelation function 


Mr,, r,) = ([.rtf,) - m(f,)][.r(r,) - m(f 2 )]). 
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k x(t) 


k x(t + t) 



Fig. 2.6 Illustrating the significance of the autocorrelation function of a real random 
process x(t) of zero mean, as a measure of statistical similarity. As the separation r 
between the sample functions at two different instants of time t and ; + r gradually 
increases, *.vft) and k x(t + r) will acquire quite different values for each t, and the posi¬ 


tive and the negative contributions will tend to cancel out in their product. 

There is a theorem which asserts that the Fourier transform of the autocorrelation func¬ 
tion is necessarily non-negative. This theorem, which is relevant to the definition of the 
spectrum of a stationary random process (see Section 2.5) is known as Bodmer's theorem 
M&W, p. 18). 

When the random variable is complex, we write z{t) instead of x(t) and we define its 
autocorrelation function in a somewhat similar way to how it is defined for a real process 
xit) If the complex process is stationary and of zero mean, we have in place of Eq. (3) 


(5) 


R(t) = (z'(t)z(t + r)), 


where the asterisk denotes the complex conjugate. The inequalities (4a) and (4c) still hold. 
Instead of the formula (4b) one has 


R(~t ) = R\t ). 


( 6 ) 


Later we will also need a generalization of the autocorrelation function for situations 
involving two random processes, say ?,(r) and * 2 (f). These processes may represent, for 
example, the field variable at two points P { and P 2 in space. Assuming that the processes 
are also jointly stationary, i.e. that the joint probability of C|(r) and : 2 (t) is invariant with 
respect to translation of the origin of time, the measure of their correlation is the so-called 
cross-correlation function 


R 12^ T ) = (s’ (0^2 (’ + r )}‘ 


(7) 


It has the following properties: 



(8a) 
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and 


«, 2 (-t) = /?;,(>). (8b) 

Among the various parameters which characterize a random process, the mean m(t) and the 
autocorrelation function R(t h t 2 ) are particularly useful When the mean is independent of time 
and the autocorrelation function depends only on the time difference t 2 — ty one says that the 
process is stationary’ in the wide sense. Obviously a process which is (strictly) stationary (i.e. 
for which all its probability densities remain invariant under the translation of origin of time) 
is also stationary in the wide sense, but the converse is, of course, not necessarily true. 


2.4.1 The autocorrelation function of a finite sum of periodic components 
with random amplitudes 

We will now examine the behavior of the autocorrelation function of a class of stationary 
random processes w'hich has a bearing on the concept of the spectrum of a stationary ran¬ 
dom process, which we will discuss in Section 2.5. 

Consider a random process represented by the ensemble 

z(t)= {**«)} (*=1,2,3 . M). (9) 

where each realization *z(t) is a sum of periodic components with frequencies w,. oj 2 , . . ., 

%- i-e. 


k. 


(t) 




+ k C 2 e~ im >' + ■ ■ ■ + *£ 


M 


( 10 ) 


the k Q„ (in = 1,2, 3,. . .) being (generally complex) random variables. We will assume that 
for each m the k Q m is of zero mean. i.e. that 


<*CJ = 0 (> n = 1. 2 . M), 


(ID 


and that the process is stationary, at least in the w'ide sense. 
The autocorrelation function of this random process is 


R(t) 


(z*(t)z(t + T)) 

M 

[m =1 
M N 

E E(CO« 

m = 1 «=1 


E C, e 

[n =1 


-iu: n [t+r) 


( 12 ) 


Since the process is stationary, the right-hand side must be independent of t and this can 
evidently be so only if 


(C C ) = 0 w hen n =* in. 


(13) 
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the periodic terms of different frequencies must be uncorrelated. The formula (12) then 

~..fies to 


M 


= £ <CO~ 


(14) 


m = 1 


*e >ee that the autocorrelation function R(t) is now the sum of periodic terms at the same fre- 
. Bencies «]. qj 2 . .. to M as are present in the sample functions (10) of the process and that the 
c«ctral component of each frequency w m is (apart from a proportionality constant depending 
■ the choice of units) proportional to the average “energy" (or the average “power") associ- 
med with each periodic component of the process. We note that the autocorrelation function 
I ides no information about phases of the periodic components of the sample functions. 
This process is evidently not ergodic because, as we mentioned earlier, for an ergodic process 
lt| -) —*• 0 as r — > x, a condition which is not satisfied by the expression ( 14). 

2.5 The spectral density and the Wiener-Khintchine theorem 

The Fourier spectrum, which provides information about the strength of the periodic com- 
jxments of various frequencies into which a function can be decomposed, is an important 
concept in much of physics and engineering. It can be applied to many deterministic func¬ 
tions encountered in practice. 

The situation is more complicated with functions that are sample functions of a station¬ 
ary random process. Such functions do not have a Fourier representation, because they are 
defined in the time interval —< t < 30 and do not approach zero as 1 —> x and t —» — x. 
Attempts to decompose such functions into harmonic components, i.e. into periodic com- 
-onents. have a long, rich and fascinating history, which cannot be discussed here. We will 
inly show how their formal decomposition may be used to introduce an important concept, 
namely the concept of the spectrum of a (w ide-sense) stationary random process. We will 
gnore the mathematical subtleties encountered in a rigorous definition of the spectrum of 
such a process. 1 

Consider a wide-sense-stationary complex random process z(f) of zero mean. i.e. such that 


{*«> = 0 . 


( 1 ) 


Let us formally represent a typical realization as a Fourier integral 



( 2 ) 


\ rigorous treatment requires the use of distribution theory, also called generalized function theory. An excel¬ 
lent account of this subject is given in H. M. Nussenzveig. Causality and Dispersion Relations (Academic 
Press. New York. 1972). Appendix A. pp. 362-390. 
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Then, on inverting Eq. (2), we have 

k C(oo) = — f k z(t)e'" J 'dt 

2i r J _x 

and, consequently, w ith any two frequencies to and w ’. 

*$*(<y)*£(w'J = — l - r f f k z*(t) k i{t')edt '. 
(27t)“ J _® J -X 


(3j 


(4) 


Let us set r' = t + r, take the ensemble average of both sides and formally interchange 
ensemble averaging and integration. We then obtain the formula 


{£* («)£(*>'J) 



^(r)e 1(<0 w), e 1 “' T Al dr. 


(5) 


where 

R(t) = (z*(t)z(t + t)) 


( 6 ) 


is the autocorrelation function of the random process encountered in Section 2.4. The inte¬ 
gration over ( on the right-hand side of Eq. (5) can be carried out at once, being propor¬ 
tional to a Dirac delta function: 


a 


- | giiw to)t = _ 0J j 


(7) 


Equation (5) therefore implies that 

(C*(£o)C(w')} = S(io)6(io - to'), (8) 


where 


S(to) = J' R(r)e la,T dr. 


On taking the Fourier inverse of Eq. (9a) we have 

R(t) = J* S(to)e~ ,clT dto. 


(9a) 


(9b) 


The formulas (8) and (9) imply two things. The first shows that for a w'ide-sense-stationary 
random process the (generalized) spectral components of different frequencies are not cor¬ 
related: and that the “strength" of the “self-correlation" (for to' = to) is, according to Eq. 
(9a), equal to the Fourier transform S(to) of the autocorrelation function R(r) of the random 
process. S(<w), w'hich is formally defined by Eq. (8). is known as the spectral density (also 
known as the spectrum or power spectrum , or the Wiener spectrum) of the (stationary) ran¬ 
dom process z(t). 
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In order that S(w) agrees with the intuitive notion of a spectrum, it must, of course, be 
non-negative. That this is so is fairly clear from the definition (8) and follows rigorously 
from Bochner's theorem that was mentioned earlier (Section 2.4). applied to Eq. (9a). 

The results expressed by Eqs. (8) and (9) are natural generalizations of those encoun¬ 
tered in Section 2.4, in connection with a stationary random process consisting of a finite 
number of periodic components of different frequencies. 

The Dirac delta function on the right-hand side of Eq. (8) can be removed by integrating 
both sides of that equation with respect to co' over a small range (w — i- Aw < w' < 
at — \ Aw) and then letting A<u —» 0. This gives 



( 10 ) 


S(w) = Lim 


Aw—^0 


The integration over the small frequency range expresses a kind of smoothing. In fact 
smoothing is quite essential if one wishes to estimate the spectrum from a single realization 
of a stationary random process (see Fig. 2.7). 1 

The formulas (9) are basic relations of the theory of stationary random processes, 
expressing the so-called Wiener-Khintclune theorem. In words, this theorem expresses the 
fact that the spectrum S(w) of a wide-sense-stationary random process of zero mean and its 
autocorrelation function R(r)form a Fourier-transfonn pair. 

For later purposes we will need a generalization of the concept of the spectral density 
and of the Wiener-Khintchine theorem from a single random process z(t) to a pair of ran¬ 
dom processes s t (£) and s 2 (r) which are jointly stationary, at least in the wide sense. In 
optics Z\(t) and ~f t) often represent the fluctuating optical field (represented by complex 


Alternatively smoothing can be provided by ensemble axeraging. More specifically one can show [see, for 
example, S. Goldman. Information Theory (Prentice Hall. New York. 1955). p. 244] that 



( 10a) 


where 



(10b) 


is the Fourier transform of the truncated process 



when IfJ s T. 
when n > T. 


The ensemble average and the limiting process (T —* o») have to be taken in the order indicated in Eq. (10a), In 
fact the limit as T —* of the expression in the angular brackets does not. in general, exist. 
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Fig. 2.7 Schematic illustration of the spectrum S(to) of a random process, stationary at 
least in the wide sense, and the spectrum 

1 r T 

k g(w.T) =— / dt 

27T J -t 

obtained from the Fourier transform of a single realization k z(t) of the process, truncated 
over an arbitrary interval - T < t < T. In general, the true spectrum S(a>) cannot be deter¬ 
mined from a single realization of the process without smoothing. Taking the ensemble 
average, as indicated in Eq. (10a) of Section 2.5, provides one way of smoothing. 


analytic signals) at two points in space. By a strictly similar analysis to that which led to 
the derivation of formulas (8) and (9) one finds that 

(C,*(w)C 2 (w')) = W 12 (w)<S(m - w’X (11) 

where 

W |2 (w) = ±f R t2 { T )e^dT, (12a) 


and. taking the inverse, 

R ]2 (t) = J_^W u (LO)e~ ic,1T dw. (12b) 

In Eq. (11). Ci(w) and q 2 (w) are the generalized Fourier transforms of ;,(f) and Zjtf), respec¬ 
tively The function Vk, 2 ((a), defined by (12a), is know n as the cross-spectral density of z,(f) 
and z 2 (0 a nd the function 

^i 2 ( t ) = {z*yit)z 2 {t + t )) ( 13 ) 

is the cross-correlation function of the two random processes, whose main properties we 
noted in Section 2.4. 

We will refer to the pair of formulas (12a) and (12b) as the generalized Wiener-Khintchine 
theorem. 
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PROBLEMS 


2.1 Determine the analytic signal associated with the real signal 



1 when -1 < f s 1, 
0 otherwise. 


2.2 x(t) is a real signal, bandlimited to the range l&jjl < \a>\ < loq + A!. Show that the 
square a 2 = 4\z(t)\ 2 of its real envelope is bandlimited to the range — A < o> < A. 

2.3 Consider a real random process 


x(t) = a sin(wr + 9), 


where a and w are constants and 8 is a random variable. Find necessary and sufficient 
conditions on 9 to ensure that the process x(t) is stationary' in the wide sense. Give also 
a specific example of such a random variable 8. 

2.4 x(t) is a wide-sense-stationary real random process of zero mean. Show that the com¬ 
plex process z(t), obtained from x( t) by replacing each sample function by the associ¬ 
ated analytic signal, is also stationary in the wide sense and is of zero mean. 

2.5 A random process x(t) = A, where A is a random variable, which is uniformly distrib¬ 
uted on the interval (—1, 1). 

(a) Sketch some sample function of the process. 

(b) Determine the autocorrelation function of x(t) defined 

(i) by a time average; and 

(ii) by an ensemble average. 

(c) Is x(r) wide-sense stationary? Is it strictly stationary? 

(d) Is x(t) an ergodic random process? 

Your answers to (c) and (d) should provide justifications. 

2.6 x( t) is a real wide-sense stationary random process and v(r) is a process derived from 
x{t) by linear filtering, i.e. by a relation of the form 



Show that y(t) is also a wide-sense-stationary random process and dense relations 
between (i) the autocorrelation functions of the two processes and ii) the power spec¬ 
tra of the two processes. 

2.7 Determine the autocorrelation function of the random process 


x(t) = cosfwjf ~ (j>\) + a 2 cos(o> : r — oA. 


where a x , a 2 , w, and <u 2 are known constants and o, and cy are mutually independent ran¬ 
dom variables, each of which is uniformly distributed on the interval (0.2w). 

Determine also the power spectrum of the process. 
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2.8 z(t) is a complex stationary Gaussian random process of zero mean. Let 

, (,) = when W ^ T 

r it) when | f | > T 
and let £(w. T) be the Fourier inverse of Further let 

_ CUo. Dg(g>, T) 


S T (a>) 


IT 


and 


S(co ) = Lim{5' 7 -(w)). 

Show that, at any frequency oi for which S(w) + 0, the variance of S T {w) does not tend 
to zero at T —* 3C . What is the implication of this result for the problem of determin¬ 
ing the spectral density function of the process from one of its sample functions? 

2.9 The autocorrelation function of a real stationary random process .x(t) is 

R( t) = { [ -W t when \t\ < T, 

\ 0 otherwise. 

Determine the power spectrum of the process. 

2.10 Consider two random processes 

x(t) = li cos( wr) + v sin(wf), y(t } = —u sin(wf) + v cos(wr), 

where u and v are uncorrelated random variables with zero mean and with the same 
variance. 

(i) Find the cross-correlation function of the two processes. 

(ii) Determine whether the two processes are jointly stationary in the wide sense. 

(iii) Determine whether the two processes are ergodic with respect to their cross¬ 
correlation function. 
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Second-order coherence phenomena 
in the space-time domain 


3.1 Interference law for stationary optical fields. The mutual 
coherence function and the complex degree of coherence 

In Section 1.1 we pointed out that light need not be monochromatic in order to produce 
interference between two light beams; it is necessary only that the light vibrations in the 
two beams are “statistically similar” to each other. We will now introduce a quantitative 
measure of statistical similarity, with the help of Young’s interference experiment. 

Suppose that light, assumed to be stationary, at least in the wide sense, illuminates two 
pinholes Q 1 and Q 2 in an opaque screen A (Fig. 3.1). Let V(Q t , t) and V(Q 2 , t) represent the 
light vibrations at the pinholes at time t. For simplicity we consider V(Q. t) to be a (com¬ 
plex) scalar. Generalization to vector fields will be considered in Chapter 7. We will exam¬ 
ine the distribution of the average intensity in the neighborhood of some point P on a screen 
■S behind the screen A containing the pinholes. 




Fig. 3.1 Notation relating to Young's interference experiment with quasi-monochro- 
matic light. 
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Let R | and R 2 denote the distances Q\P and Q 2 P respectively. Since the light from the 
pinholes £3] and Q 2 takes the times 


t 


i 



c 



( 1 ) 


to reach the point P, V(P, t) is given by the expression 

V(P, t) = K t V(Q h t-t l ) + K 2 V(Q 2 , f - h), (2) 

where the factors K l and K 2 take into account diffraction at the pinholes and c in Eq. (1) 
denotes the speed of light in vacuum. It follow's from the Huygens-Fresnel principle (B&W, 
Section 8.2) that, if the angles of incidence and diffraction at the pinholes are sufficiently 
small, then 1 


K J ~°' = 1 ’ 2) - (3) 

where d;4| and dA 2 are the areas of the two pinholes and A is the mean wavelength of the 
incident light. 

Because the fluctuations of the light are very rapid, one cannot observe the time behavior 
of V(P, t), but one can measure the expectation value HP) = I(P, t ): = V* (P, t)V(P, tt) of 
the intensity, which is independent of time, because of the assumed stationarity. Using Eq. (2), 
it follow's that 


I(P) = \Ktf(V*(Q v t - t x )V(Q v t - h )) 

+ \K 2 \ 2 (VUQ 2 ,r - t 2 )V(Q 2 ,t - t 2 )) 

+ 2!%e{K*K 2 {V*(Q,,t - t,)V(Q 2 J - r ; ))}, (4) 

^denoting the real part. Using the assumption of stationarity. and also the fact that K t and K 2 
are purely imaginary' numbers, the expression (4) may be simplified and becomes 

HP) = IK^-RQO + \K 2 \ 2 I(Q 2 ) + 2^{|A*|r(!2 1 , 0* r, - r 2 )}, (5) 

where I(Qj) = (V*( Q,. t)V(Qj , r)) (J = 1. 2) are the averaged intensities of the light at each 
of the two pinholes and 


r (Q r Q 2 . T) = (V'(Q r r)V(Q 2 , t + r)) (6) 

is the cross-correlation function [Eq. (13) of Section 2.5] of the field at the pinholes at 
and Q 2 - In the present context T(0i, Q 2 , r) is called the mutual coherence function and is 
the basic quantity of the so-called second-order coherence theory, the term “second-order” 


It is convenient in the present analysis to define the K factors somewhat differently from how they were defined 
in B&W. We include here the terms d JfIR ; in their definition, thus making them dimensionless. 
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'dicating that Y is a correlation function involving a product of the field variable at two points. 
Later, in Chapter 7, we will encounter correlation functions involving a four-term product, 
which are the basic mathematical tools of the so-called fourth-order coherence theory. 1 

The formula (5) may be rewritten in a physically more significant form if we note that 
the first two terms on the right-hand side of Eq. (5) represent the average intensities of the 
ght that would be observed at the point P if only one of the pinholes were open. For exam¬ 
ple. if just the pinhole at Q\ were open, one would have K 2 = 0 and Eq. (5) then implies that 

I(Py^?y>(P) = |/r,j 2 /(Qi). (7a) 

Similarly, if just the pinhole at Q 2 were open, then 

HP) - ^(P) = I K 2 \ 2 I(Q 2 ), (7b) 


The formula (5) may, therefore, be rewritten as 


HP) = I {U (P) + I' 2 HP) + 2'fie 


ji {i HP)jP 2) (P)i'(Q r Q 2 ..t l -t 2 ) 


( 8 ) 


where 


7 ( 0 ! • Q 2 ~ t ) 


r (Q,.q 2 , t) 

Jhq 2 ) 

_ £ (Q r Q 2 , T ) _ 

VnOpGpO) Jt(q 2 .q 2 ,o) ’ 


(9) 

( 10 ) 


The formula (8) is one form of the so-called interference law for stationary optical fields. 
It shows that, in order to determine the (average) intensity at a point P in the observation 
plane ®, one must know not only the average intensities of the two beams at P but also the 
real part of the correlation coefficient -y (Q t , Q 2 . r), called the complex degree of coherence. 
of the light at the pinholes at <2i and Q 2 . In view of the significance of the cross-correlation 
function which we discussed earlier, the complex degree of coherence is e\ identic a precise 
measure of the statistical similarity of the light vibrations at the points Q , and Q 2 . 

It follows from the definition (10) and the property of the cross-correlation function 
expressed by Eq. (8a) of Section 2.4 that, for all values of its arguments. ~ is bounded by- 
zero and by unity in absolute value, i.e. that 

HQuQi,t)\ « 1. (11) 

The extreme value zero represents complete absence of correlation: the other extreme case, 
unity, represents complete correlation of the vibrations at Q and Q 2 . In the first case the 


This terminology is not quite uniform. What we call second-order and fourth-order coherence functions are 
sometimes called first-order and second-order coherence functions, respectively. 
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vibrations are said to be mutually completely incoherent ; in the second case they are said to 
be mutually completely coherent. 

A deeper insight into the significance of the complex degree of coherence may be obtained 
by expressing the interference law ( 8 ) in a somewhat different form, using the envelope rep¬ 
resentation of narrow-band signals which we discussed in Section 2.3. Let co be the mean 
frequency of the light and let us write 

? {Q v Q 2 , T) = b'(0 2 , Q z , (12) 

where 

»(£?i, Qj. r) = u>t + arg-KO,, Q 2 , r). (13) 

As we learned in Section 2.3, MQb Qi • r ) changes slowly over any time interval r which 
is short relative to the reciprocal bandwidth 1/Aw of the light. If we express 7 in the inter¬ 
ference law ( 8 ) in the form (12), we obtain for the intensity at P the expression 

/(/>) = /(b ( p) + /(2>(P) + 2 \liQv Qi ■ r )l cosfafij,, Q 2 , T) — 5], 

(14) 

where 

2.1T 

6 = wr = w (t 2 - fj) = ^(R 2 - A*,), (15) 

A 

A = lit cl to being the mean wavelength. 

The formula (14) is an alternative form of the interference law ( 8 ) for stationary optical 
fields. Let us examine its implications. 

Suppose that P 0 is some fixed point in the detection plane B. The average intensities / 1( (P) 
and P 2> ( P ) will change slowly with P in the neighborhood of P {) and so will Q 2 , r)|. 
Now, we have already noted that the phase factor a(Q], () 2 . t) will vary slowly with the 
time delay r = ( R 2 — R\)lc over r-intervals which are short compared with the coherence 
time of the light. Consequently, considered as a function of the phase delay 6. the last term 
in Eq. (14) will be a slowly modulated cosine term whose amplitude remains effectively 
constant over regions of the detection plane B for which the change in R 2 — A’j is small in 
comparison with the coherence length of the light. Under these circumstances interference 
fringes will be formed in the observation plane B. which will be essentially sinusoidal, with 
the amplitude and the phase of the sinusoidal patterns changing very slowly with position. 
This situation is illustrated in Fig. 3.2 for the commonly occurring situation when 
(P) = I 1 ] {P). In this case the interference law (14) reduces to 

I(P) = tPKP){ 1 + (7021, Q 2 , rJlcMtoCCj, 0 2 , t) - A]}. (16) 

This formula implies that when 1 7 ) = 1 (and only in this case) there are points in the fringe 
pattern where the average intensity is zero [namely points for which a{Q\, Q 2 . r) — 8 = 
(n + 4-)7r (n = 0, ± 1 , ± 2 , ...)]. i.e. where there is complete cancellation of the light by 





3.1 Interference for stationary fields 


35 


I 


I 




I 


2 p 1 ) 


fi 


a) coherent superposition 

(It! = i) 


(b) partially coherent superposition 

(0 < M < 1) 


(c) incoherent superposition 

(M-0) 


Fig. 3.2 The distribution of the average intensity in the neighborhood of an arbitrary 
point in the interference pattern formed by two quasi-monochromatic light beams of 
equal intensity F 1 ’, with the correlation between them characterized by a degree of 
coherence 7 . 


destructive interference. This is the case of complete coherence. In the other extreme case. 

= 0 , no interference fringes are formed at all, this being the case of complete incoherence. 
In all other cases, (0 < lyj < 1) fringes are formed but their contrast is lower than in the 
completely coherent case. One then says that the light at the pinholes is partially coherent. 

It is evident from Eq. (16) that the maxima and minima of the average intensity in the 
immediate neighborhood of any point P in the plane B of observation are given by the 
formulas 

/ max (P) = 2/ 1 »(P)[l + b(0„ Q 2 , t)|] (17a) 

and 

I m JP) = 2/»(P)[l - | 7 (Q,. Qi, t) |] . (17b) 

A useful measure of the contrast (i.e. of the sharpness) of the interference fringes is their 
so-called visibility, ‘ V, defined as 


%’(P) 


l (P) ~ I ( P) 

max v ' _ mm 

/ (P) + i (P) 

max v ' min 


(18) 


On substituting from Eqs. (17) into Eq. (18) we see at once that 

= i7(0i.o?^)i (i9) 

This formula relates a quantity that can be readily measured (the visibility V) to the more 
abstract concept of the modulus of the complex degree of coherence. The argument (the 
phase) of 7 can also be determined experimentally, by measurement of the location of the 
intensity maxima in the fringe pattern (B&W, Section 10.4.1). 

The complex degree of coherence p{Q\, Q\, r) provides a quantitative characterization 
of both temporal and spatial coherence, which were introduced in Sections 1.2 and 1.3 as 
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two distinct concepts. The temporal coherence of light at one point Q is now characterized 
quantitatively by 7 (Q, Q, t). For example, in the Michelson interferometer. Q is a point on 
the dividing mirror M 0 , shown in Fig. 1.2. and r represents the time delay introduced 
between the two interfering beams by displacing one of the mirrors, M x or M 2 , from the 
“symmetric position” by a distance cr/2. The visibility of the fringes in the detection plane 
B is then equal to \y(Q. 0. r)|. 

The spatial coherence of light at the two points (), and Q 2 is characterized by •yiQi, Q 2 , 
To), where r 0 is some fixed time difference r 0 = t 2 — t y (often taken to be zero) in Young’s 
interference experiment. As we just saw. the visibility of the interference fringes at a point 
P in the detection plane, for which the path difference Q 2 P — Q\P = ct 0 , is equal to 
|7(fil. 02> T 0 )j.- 

We will learn later (in Section 3.5) that only in special situations can temporal and spa¬ 
tial coherence of an optical field be treated independently of each other, because the mutual 
coherence function propagates according to precise differential equations, which relate its 
spatial and its temporal behavior. 

Returning to spatial coherence, we have already mentioned that the time delay r is often 
taken to be zero. This is generally the case when there is some kind of symmetry in the 
experimental set-up or when one is dealing with optical images near the axis of a centered 
optical system. In such cases the coherence effects are usually adequately described by the 
simpler correlation functions 



J(Q v Q 2 ) = r(0,.0 2 ,O) = (V’(Q v t)V(Q 2 ,l)) 

(20) 

and 

rto,, Q-,, 0) 

j(Q v Q.) = i(Q v Q 2 ,0) = -7 -- 

V r (0i- 0p 0) Vr(j2 2 ,0 2 ,o) 

(21a) 


p(Q r Q { ) p{Q 2 ,Q 2 ) 

(21b) 


■K.Q r Q 2 ) 

4^Qy) Jhq 2 ) 

(21c) 

where I(Qj) = {V*(Q p t)V(Qj. t )) = J(Q r Qf (j = 1 ,2) denotes the average intensity at the 
point <2j. These functions may be called equal-time coherence functions. Often J(Q U Q 2 ) 
is referred to as the mutual intensity and j{Q\, Qi ) as the equal-time complex degree of 
coherence. 

It follows from the envelope properties of narrow-band light (discussed in Section 2.3) that 


r (Q v Q 2 .t) - •/(£,. 

(22) 


7(01. 02' T ) ~ 2(0!, 02 )e _iwr , 

(23) 
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provided that |r| is small relative to the coherence time, i.e. provided that 



(24) 


3.2 Generation of spatial coherence from an incoherent source. 
The van Cittert-Zernike theorem 


We have learned from our elementary' treatments of the concepts of coherence area and 
of the coherence volume (Sections 1.3 and 1.4) that even a spatially incoherent source will 
generate a field which may be spatially coherent over large regions of space. Evidently in 
such situations spatial coherence is generated in the process of propagation. We will now 
give a simple intuitive argument indicating why this happens and we will then discuss this 
phenomenon quantitatively. 

Consider light emitted by two small steady-state sources S and S 2 . We assume that the 
light is quasi-monochromatic of mean frequency w and that the sources are statistically 
independent, so that there is no correlation between the beams of light which they gener¬ 
ate. We will compare the light vibrations at points some distance away from them. 

Let KjCPi, t) and V ] (P 2 , t) represent the field at points P and P 2 , respectively, due to the 
source .S', and let V 2 (P\, t) and V 2 (P 2 , 0 represent the field at these points due to th e source 
S 2 (Fig. 3.3). If the difference between the distances = SiP j and R u = S,P 2 is small 
compared with the coherence length (~2ttc/Aw) of the light, one obviously has 


V,(P,, t) t). 


(la) 



Fig. 3.3 Schematic illustration of the generation of spatial coherence from two small, 
uncorrelated sources. 
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Similarly, if the difference between the distances R 2 j = S 2 Py and R 22 = S 2 P 2 is small relative 
to the coherence length of the light one has 

V 2 (P 2 'f)*-Vnff U t). (lb) 

The total field at P j arises from the superposition of the field generated by the two sources 

(see Fig. 3.3) and hence is given by the expression 

V(P U t) = V,(P„ 0 + V 2 (P h r ). (2a) 

Similarly, the total field at P 2 is given by the expression 

V(P 2 , t ) = Vj(P 2 , 0 + V 2 (P 2 , f). (2b) 

Now, since Vi(Pi, t) and V 2 {P\, t) are generated by sources .S’, and S 2 which are statistically 
independent, these two disturbances will not be correlated. For the same reason Fj(P 3 , t) and 
V 2 (P 2 , t) will also not be correlated. However, the sums V^P], f) + V 2 {Pt) and V\(P 2 , t) + 
V 2 (P 2 , t) will be correlated because of the relations (1) and, consequently, 

V(P 2 ,t)^V(P l .t). (3) 

This conclusion is also evident from the sketch in Fig. 3.3, where the (essentially identical) 
wave trains F,(Pi, t) and V^Pj, 0 arriving at the points P, and P 2 , respectively, from .S’, are 
drawn as solid lines and the (essentially identical) wave trains V 2 (P\ -1) and V 2 (P 2 , t) arriv¬ 
ing at P\ and P 2 from the other source S 2 are drawn as dashed lines. Clearly, although the 
solidly drawn wave trains and the wave trains drawn as dashed lines may have completely 
different forms, the sum of the two wave trains arriving at P and the sum of the two wave 
trains arriving at P 2 will be similar to each other. Thus the fields at P t and P 2 . given by Eqs. 
(2a) and (2b). will evidently be strongly correlated (i.e. statistically similar). Hence we see 
that, even though the two small sources 5/ and S 2 are statistically independent, they will 
give rise to correlations in the field which they produce and the correlations are evidently 
generated by the process of propagation. 

Let us now examine quantitatively in a more precise mathematical language the correla¬ 
tions which are generated in the field produced by an extended planar source a of natural 
light. We assume that the source is statistically stationary, at least in the wide sense, and 
that it emits radiation in a narrow range of frequencies Aw around a mean frequency w. We 
also assume that the linear dimensions of the source are small relative to the distances 
between the source and the points Pi and P 2 and that the angles which the lines from each 
source point to Pj and P 2 make with the normal to the source are small (see Fig. 3.4). 

Let us imagine that the source is divided into elements doy, day, . . ., da,„ centered at 
points Sj, S 2 , ■ . ., S m and that the linear dimensions of the elements are small compared 
with the mean wavelength A of the emitted light. Let V m] (t) and V m2 {t) be the complex ampli¬ 
tudes of the field at the two points P, and P 2 in the field, contributed by the source element 
da„,. The total complex field amplitudes at the two points are then given by the expressions 

y^ = J2v m] (t), v 2 (t) = £v„ l2 (0. 


m 


(4) 
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Fig. 3.4 Notation relating to the derivation of the van Cittert-Zemihe theorem. 

It follows that the mutual intensity J(Q\, QA. defined by Eq. (20) of Section 3.1. is, in the pres¬ 
ent case, given by the expression 

J(P r P 2 ) a (V;«E 2 f t)) 

= E + EE (O' OV„ 2 (0). (5) 


Since we assumed that the source generates natural light, the contributions from the differ¬ 
ent source elements may be assumed to be uncorrelaled (mutually incoherent) and also of 
zero mean and hence 

EE<E,( f )v 2 (o) = EEOC .(0)(v„ 2 (0) = o. (6 ) 


If R ml and R m2 are the distance of the held points P t and P 2 from the source element drr m , then 

exp[-iw(t - R ml /c)\ 


V At) = A (t - R ,/c)- 

mP 7 //i' ml 7 


V ,(/) = A (t - R ,/c) 

m2 y 7 m v m2 7 


R 


exp[-iw(; - R ml /c ) 


R , 

ml 


(i) 


where the modulus |A W | of A m represents the strength and argA,„ represents the phase of the 
radiation emitted from the element d<j,„ of the source, c denoting the speed of light in vacuum. 
On substituting from Eq. (7) into Eq. (5) and using Eq. (6), it follows that 


j(p r p 2 ) = E(AE - - R m2 /o) 

m 

= J2«(^Jt-{R m2 -R,J/cl) 


exp[i co(R , - R )/c ] 


R ,R , 

ml ml 

exp[i g{R m2 - R ,, /, I 


( 8 ) 


where we have used the fact that the source is statistically stationary. If the path differences 
R m2 ~ R,„] I are small relative to the coherence length of the light, At ~ 2t/Aw, we may 
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also neglect the retardation terms {R m2 - R m \)/c in the averages, and the formula (8) then 
becomes 


j(p v p 2 ) = E(Kmjt)) 

m 


exp [—\oj(R 7 - R, )/c\ 


m2 

R ,R , 

ml m2 


(9) 


The average (A m {t)A m (t)) represents the intensity of the radiation emitted from the source 
element da m . In any practical case the total number of the source elements will be so large 
that one may consider the source to be effectively continuous. If we denote the intensity 
emitted per unit area of the source by I(S). then (A*,(r)A„,(f)} ~ 7(5',„)dcr m> and we may 
replace the summation in Eq. (9) by integration. We then obtain the following expression 
for the mutual intensity at the pair of points P\ and P 2 : 


n „i -R,) 

J(P V P 2 )= US)— — -AS, (10) 

(7 *^2 

where R j and R 2 are the distances between a typical source point 5 and the field points P t 
and P 2 respectively, k = ca/c = 2 tt/\ being the mean wave number of the light. 

If we recall the definition Eq. (21c) of Section 3.1 of the equal-time degree of coherence 
j(P\, Pi) and use Eq. (10), we obtain at once the following formula for the equal-time 
degree of coherence of the light at the two field points: 


where 


m^i) 


XX) XX) 


I(p j ) = J(p J .p J ) 


l 

r m 

X R 2 


e )k(R 2 -R l ) 

ns )— —d,s. 


R l R 2 


dS (j = 1,2) 


(ID 

( 12 ) 


being the (averaged) intensities at E, and P 2 . 

The formula (11) is the mathematical formulation of a central theorem of optical coherence 
theory, known as tile van Cittert-Zemike theorem. It expresses the equal-time degree of coher¬ 
ence j(P ], P 2 ) at two points P-, and P 2 in a field generated by a planar, statistically stationary, 
spatially incoherent quasi-monochromatic source a in terms of the averaged intensity distri¬ 
bution I(S ) across the source and the average intensities I(P t ) and I(P 2 ) at the two field points. 

The integral that appears on the right of Eq. (11) is of the same form as is encountered in 
quite a different connection, namely in the theory of diffraction at an aperture in an opaque 
screen. To see this analogy we recall that if a monochromatic spherical wave 

V(S, t) = U(S)e~ ia ”, (13a) 

with 


U{S ) = a(S)- 


3 -i kR, 


R, 


(13b) 










3.2 Spatial coherence from an incoherent source 


41 


Coherence 

van Cittert-Zernike Theorem 


Diffraction at an Aperture 

Huygens-Fresnel Principle 



points P | and Py. at P 2 due to the diffraction of a monochromatic 


j(P r P 2 ) 



gi/tifij) 


r,r 2 


dS 


spherical wave converging to the point Pp. 



incident diverging 

converging spherical 

wave secondary 


wave 


Fig. 3.5 The analogy between the van Cittert-Zernike theorem and the FIuygens-Fresnel 
principle. 


converging to a point P\ (see Fig. 3.5) is incident on an aperture .1 in an opaque screen, 
then the diffracted field at a point P z is, according to the Huygens-Fresnel principle [ B&W. 
Sections 8.2 and 8.3, especially Eqs. (1) of Section 8.2 and Eq. (17) of Section 8.3] given 
by the expression (with the time-dependent factor e _ '“" omitted) 



w here N’ is a constant, small angles of incidence and diffraction being assumed. 

Comparison of the van Cittert-Zernike theorem expressed b\ the formula (11) w ith the 
Huygens-Fresnel principle (14) brings into evidence the following rather remarkable anal¬ 
ogy; the van Cittert-Zernike theorem implies that, under the conditions stated, the equal¬ 
time degree of coherence j( P \, Pi) is given by the normalized complex amplitude at a point 
Pi in a certain diffraction pattern: namely that formed hx a monochromatic spherical wave 
of frequency &>, converging towards a point P t and diffracted at an aperture A in an opaque 
screen of the same size, shape and location as the incoherent source a, with the amplitude 
distribution across the aperture being proportional to the intensity distribution across the 
source. (See Fig. 3.5.) 
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Y 




Fig. 3.6 Notation relating to the derivation of the far-zone form of the van Cittert- 
Zernike theorem. 


We obtained this analogy from a formal comparison of the van Cittert-Zemike theorem 
of coherence theory with the Huygens-Fresnel principle of diffraction theory. A deeper 
reason for this analogy will become evident later (Section 3.5). 

Frequently the field points P, and P 2 are located in the far zone of the source, often in a 
plane A parallel to the source plane. The van Cittert-Zernike theorem then takes on a sim¬ 
pler form which we will now derive. For this purpose we choose a Cartesian coordinate 
system in the source plane and denote by (£, rj) the coordinates of a source point S. We also 
take a coordinate system in the plane Jl with origin O' and with the X. F-axes parallel to the 
rj -axes (see Fig. 3.6). If {Xj, Fj) and (X 2 , F 2 ) are the Cartesian coordinates of the field 
points and P 2 , respectively, in the plane A. then the distances P, = SP t and R 2 = SP 2 
are evidently given by the expressions 


Rf = (X, - O 1 + Oj - rj 2 + R 2 , 

R l = (x 2 - o 2 + (Y 2 ~ n) 2 + R 2 * 


(15) 


so that 


R, 


(x ] - if + (F, - n) 2 


2 R 


(16) 


R-, == R + 


(x 2 -Q 2 +(Y 2 - g) 2 


2 R 


On the right-hand side of Eqs. (16) we retained only the leading terms of the power-series 
expansions, which is justified if we assume that the points P, and P 2 are at distances from 
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die axis 00' which are small relative to R. It follows from Eqs. (16) that 


*2 -*J 


(xj +r 2 2 ) - ex , 2 + y?) 

2 R 


(x 2 - x,ic + (,t, -y,)r/ 

R 


(17) 


In the denominator of the integrands in Eqs. (11) and (12) /?, and R 2 may be replaced, to a 
cood approximation, by R It is also convenient to set 


x 2 -x. 


R 


= P, 



(18) 


and 


a-[(X 2 2 + t 2 ) - (X , 2 + r , 2 >] 
2 R 


(19) 


It follows that when the points P, and P 2 are in the far zone, Eq. (11) for the equal-time 
degree of coherence reduces to 


j(P r P 2 ) 


dr; 


( 20 ) 


This formula shows that, when the linear dimensions of the source and the distance between 
P\ and P 2 are small relative to the distance of these points from the source, the equal-time 
degree of coherence j(P j, P 2 ), apart from a phase factor, is equal to the normalized Fourier 
transform of the intensity distribution across the source. We may refer to Eq. (20) as the 
far-zone form of the van Cittert—Zeniike theorem. The phase t/\ defined by Eq. (19), ha s a si m- 
ple interpretation. It represents, to a good approximation, the phase difference kfPfPs ~ 
PjP]] between the field points P, and P 2l in the plane ;T and points P\ and P- at the same 
height from the 00' axis as P x and P 2 , located on the sphere centered at the origin O in the 
source plane and passing through the origin O' in the plane A (see Fig. 3.7). 

In its mathematical structure the expression on the right of Eq. (20) resembles the well- 
known expression of elementary diffraction theory for Fraunhofer diffraction at an aperture 
iB&W, Section 8.3.3). This was to be expected in view of the analogy between the van 
Cittert-Zemike theorem and the FIuygens-Fresnel principle, which we noted earlier. 

As an example let us determine the equal-time degree of coherence of the far field pro¬ 
duced by an incoherent, quasi-monochromatic circular source of radius a and of uniform 
intensity i 0 . The formula (20) gives, for this case. 


7(P 1 .P 2 ) 


e nt- I'M *g) 


m 0) 


( 21 ) 










44 


Second-order coherence in the space-time domain 



Fig. 3.7 Illustrating the significance of the phase i/j, defined by Eq. (19) of Section 3.2. 
It represents, to a good approximation, the phase difference klP'iPj ~ P'\P\]- 


where 


hf,g) = i 0 tn‘#**vd{d V . 


( 22 ) 


The integral on the right of Eq. (22) can readily be evaluated and one finds that (B&W, 
Section 8.5.2) 


I ( kp , kq) = 7 to 2 


2 J x | ka^Jp 2 + q 2 | 
ka ^Jp 2 + q 2 


(23) 


where 7, is the Bessel function of the first kind and first order, On substituting from Eq. (23) 
into Eq. (21) we obtain for the equal-time degree of coherence of the far field the expression 

27.(v) . 

j(P\'P-)) = ———e'^ (24) 

v 


where 


v = 



(25a) 
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Recalling the expressions for p and q given by Eqs. (18), we can express v in the form 


v 



(25b) 


»here 


d = - X 2 ) 2 + (E, - Y 2 ) 2 


(26) 


is the distance between the points P x and P 2 in the observation plane JL 

Apart from a trivial proportionality constant the expression on the right-hand-side of 
Eq. (24) will be recognized as the Airy formula for the field distribution in the Fraunhofer 
diffraction pattern of a uniformly coherently illuminated circular aperture [B&W. Section 8.5, 
Eq. (13)]. Its modulus is plotted in Fig. 3.8. It decreases steadily from the value unity when 
v = 0 to the value zero when v = 3.83 (indicated by the point B in the figure). Hence, with 
increasing separation of the two points P\ and P 2 the equal-time degree of coherence decreases 
from the value unity (complete coherence) to zero (complete incoherence) when the right- 
nand side of Eq. (25b) is equal to 3.83, i.e. when 


d = 


3.83 


0.617?A 


(27) 


A further increase in the separation of the two points reintroduces a small amount of coher¬ 
ence but the absolute value of the degree of coherence is then smaller than 0.14 and there 
is further incoherence, indicated by point C in Fig. 3.8 when v = 7.02. 



\pPi-Pi)\ 


Fig. 3.8 The absolute value of the equal-time degree of coherence j(P\, Pi) = 
e r *[2J l (kad/R)l(kad/R)], [Eqs. (24) and (25b)], at points P, and P 2 in the far zone, gen¬ 
erated by a uniform, incoherent, quasi-monochromatic circular source of radius a. 
















46 


Second-order coherence in the space-time domain 


The function 2Jfv)lv decreases steadily from the value unity when v = 0 to the value 
0.88 when v = 1 (indicated by points in the figure), i.e. for separation 


d = 


0.167? A 
a 


(28) 


In practice, a drop from the value unity (complete coherence) which does not exceed 12% 
is often not regarded as being very significant. Hence, roughly speaking, in the far zone and 
close to the direction normal to the source plane and parallel to it, the light produced by a spa¬ 
tially incoherent, quasi-monochromatic uniform, circular source of radius a is approximately 
coherent over a circular area AA, whose diameter is 0.16,Va, where a = a/R is the angu¬ 
lar radius subtended by the source when viewed from A/A. We note that A A = 
7r[0.16A/(2a)] 2 , i.e. 

D2 _-y 

A4 = 0.063 — A", (29) 

5 


where S = tuf is the area of the source. This expression is in agreement with the order-of- 
magnitude relation (2) of Section 1.3 for the coherence area. 

The far-zone behavior of the equal-time degree of coherence of light from an incoherent 
uniform, circular source which we just discussed was verified experimentally many years 
ago. 1 The results, together with the theoretical predictions, are shown in Fig. 3.9, for vari¬ 
ous separations of the pinholes. The experiment was repeated more recently, using a high- 
precision digital technique. 2 Excellent agreement with the theoretical predictions was 
obtained. 


3.3 Illustrative examples 

Two old classic interferometric techniques which we will now briefly discuss, both due to 
Albert Michelson, provide very good examples of some of the concepts and results of ele¬ 
mentary coherence theory which we just discussed, even though they were invented before 
the notion of coherence came to be understood. 


3.3.1 Michelson’s method for measuring stellar diameters 

Because the angular diameters that stars subtend at the surface of the Earth are exceedingly 
small, they cannot be measured directly even with the largest available telescopes. 
A. A. Michelson showed theoretically in 1890 and he. together with F. G. Pease, demon¬ 
strated experimentally in the 1920s that the angular diameter of a star and, in principle, also 
the intensity across the stellar disk may be obtained with the help of an interferometer as 


1 B. J. Thompson and E. Wolf, J. Opt. Soc. Amer. 47 (1957), 895-902. 

2 G. Ambrosini. G. Sehirripa Spagnola, D. Paoletti and S. Vicalvi. Pure Appl. Opt. 7 (1989), 933-939. 
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Fig. 3.9 Young's interference pattern formed by partially coherent quasi-monochro- 
matic light with three different degrees of coherence, from a uniform incoherent circu¬ 
lar source, for various separations of the pinholes. Upper figures: observed patterns. 
Lower figures: results of experiments. For details of the experimental arrangement see 
B&W Section 10.4.3 or the original paper by B. J. Thompson and E. Wolf, J. Opr. Soc. 
Amer. 47 (1957). 895-902, from which these figures are reproduced. 


shown schematically in Fig. 3.10. The principle of the technique may be understood as fol¬ 
lows. Light from the star is incident on the outer mirrors M x and Af? of the interferometer, 
is then reflected at two inner mirrors Mj and M 4 and is brought to the back focal plane J of 
a telescope to which the interferometer is attached. The purpose of the telescope is to pro¬ 
vide stability for the interferometer. The inner mirrors AT. and M 4 are fixed while the outer 
mirrors M\ and M% can be separated symmetrically in the direction joining M 3 and A/ 4 . 
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M 



Fig. 3.10 A schematic diagram of the 20-foot Miehelson stellar interferometer at Mount 
Wilson Observatory, mounted on the 100-inch reflecting telescope. [Adapted from 
F. G. Pease, Ergeb. Ex. Naturwiss. 10 (1931). 84—96.] 


In the focal plane '/ of the telescope one observes a diffraction image of the star, crossed by 
fringes formed by the two interfering beams. 

The visibility of the interference fringes in the focal plane ‘J depends on the separation 
d between the outer mirrors M , and Mn. Miehelson showed, by an elementary argument, 
that from measurements of the changes in the visibility with the separation of the two outer 
mirrors one may obtain information about the intensity distribution across the star, at least 
in cases when the star is assumed to be rotationally symmetric. Miehelson showed, in par¬ 
ticular, that, if the stellar disk is circular and uniform, the visibility curve, considered as a 
function of the separation of the two outer mirrors M\ and AT, will have zeros for certain 
separation distances d; and that for a spectral component of wavelength \ 0 the smallest of 
the values for which a zero occurs is 


d 


o 


O.blAo 

a 


(1) 


where a is the angular radius of the star. Thus from measurement of d 0 the angular diame¬ 
ter of the star may be determined. 

From the standpoint of coherence theory, the principle of the method can easily be 
understood. At the two outer mirrors A/, and AT, the incident light is, in general, partially 
















u j n 
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oherent. According to the far zone of the van Cittert-Zemike theorem [Eq. (20) of Section 
.2; see also Fig. 3.6], expressed in a slightly different form, the equal-time degree of 
coherence is given by the formula (with |t/>] 1, k 0 = <u (1 /c = 2-/A 0 ) 


j(M v M 2 ) 


jj i(u, v)e ifc oKTr- l i>“ +( >:-- ¥ i 1*1 dw dv 



i(u , v)d» dv 


( 2 ) 


Here i(u, v) represents the (averaged) intensity distribution across the stellar disk as a func¬ 
tion of the angular variables u and v of the points (£, if) on that disk. 


u 




(3) 


and (jq, yq) and (.tr 2 , y 2 ) are the coordinates of the outer mirrors A/| and M 2 . 

It follows at once from the formula (2) and the fact that the visibility of the fringes is 
equal to the absolute value of the equal-time complex degree of coherence [Eqs. (23) and 
(19) of Section 3.1] that the visibility of the fringes in the focal plane of the telescope is 
proportional to the Fourier transform of the (averaged) intensity distribution across the star. 
In particular, if the stellar disk is circular, is of uniform intensity and subtends angular 
radius a = a!R at the telescope, the formula (2) gives [cf. Eq. (24) of Section 3.2] 


I j(M r M 2 )\ 


2 ( k Q ad) 
k () ad 


(4) 


where d — ^(x 2 — .tq ) 2 4 (y 2 — >q ) 2 is the distance between the two outer mirrors M, 
and M 2 of the interferometer, J ] being the Bessel function of the first kind and first order. 
The smallest value d 0 of d for which the expression (4) vanishes is given by k 0 ad 0 = 3.83, 
implying that d i} = 0.61 A () /a, in agreement with Michelson's result expressed by Eq. (1). 

The first determination of a stellar diameter using this technique was made in the 1920s 
and employed a 20-foot interferometer attached to the 100-inch telescope at the Mount 
Wilson Observatory (Fig. 3.11). By means of it the angular diameter of the red giant star 
Betelgeuse in the constellation of Orion was determined by F. G. Pease. In these measure¬ 
ments the first “incoherence” occurred when the outer mirrors M\ and M 2 of the interfer¬ 
ometer were separated by the distance of d 0 = 307 cm. According to the formula (1) this 
implies, taking A 0 = 5.75 X 10~’em, that the angular diameter of Betelgeuse is about 
0.047 seconds of arc. Angular diameters of five other stars were determined by that 
interferometer in the 1920s, but the instrument has not been used since that time. However, 
it has been preserved at the Observatory. A photograph of it, taken in more recent times, is 
shown in Fig. 3.12. 









Fig. 3.11 The Michelson stellar interferometer mounted on the 100-inch telescope. 
[Adapted from F. G. Pease, Ergeb. Ex. Nciturwiss 10 (1931). 84-96.] 



Fig. 3.12 The original Michelson stellar interferometer preserved at the Mount Wilson 
Observatory, around the year 2000. (Courtesy of Gale Gant and Don Nicholson.) 
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Fig. 3.13 The Westbrook aperture-synthesis telescope. (Reproduced from K. Rohlfs, 
Tools of Radio Astronomy, Springer. Berlin and New York. 1986. p. 113, Fig. 6.9.) 


Since the time when the first Michelson stellar interferometer was built in the 1920s, sev¬ 
eral others have been constructed and used, one of them operating at infrared wavelengths. 
Today, however, this technique is used mainly in radio astronomy, where the principle has 
been applied with great success to map the radio sky. Because the wavelengths of radio 
waves are very- much longer than those of light, the base line of the radio interferometer - 
usually called a radio telescope or an antenna-synthesis telescope - has to be many orders 
of magnitude longer. Instead of mirrors one uses large antennas (see Fig. 3.13) and the 
incoming radio waves are sampled at pairs of them. The antennas are arranged in various 
configurations, one of which is shown in Fig. 3.14. 


3.3.2 Michelson’s method for determining energy distribution in spectral lines 

Suppose that a beam of quasi-monochromatic light is divided into tw o beams in a Michelson 
interferometer (Fig, 1.2) and that the beams are superposed after a path difference ct has 
been introduced between them. In the region of superposition interference fringes whose 
visibility depends on the path difference are formed. Michelson showed in the 1890s that 
from measurements of the visibility 1'(t) of the interference fringes, as a function of r one 
may obtain information about the energy distribution in the spectnim of the light 
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Fig. 3.14 The VLA (Very Large Array) at Socorro, New Mexico. The array consists of 
movable 25-m telescopes and operates with wavelengths ~3em. (Reproduced from 
K. Rohlfs. Tools of Radio Astronomy, Springer. Berlin and New York. 1986, p. 113, 
Fig. 6.9.) 


From the standpoint of coherence theory the principle of this method may be understood 
as follows. Let us for simplicity assume that the two beams have the same intensity. Then, 
according to Eq. (19) of Section 3.1. the visibility of the interference fringes, (considered 
now as a function of r rather than P), in the region of superposition of the two beams is 
given by the expression 

T'(r) = |7(r)|, (1) 

where 7 (r) = y(Qo, Qp, t), Q 0 denoting a typical point on the dividing mirror M 0 . According 
to the normalized form of the Wiener-Khintchine theorem [Eq. (9b) of Section 2.5], 

7(r) = f s(cu)e~ lwr dot, (2) 

J 0 

where s(w) is the normalized spectral density of the light at Q 0 . The integral on the right 
contains only positive frequency components, because we use the analytic signal represen¬ 
tation (which we discussed in Section 2.3) of the optical field. 

It is convenient to set 


7(t) = 7(r)e 


( 3 ) 
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jbere o) 0 is a mid-frequency of the light. It then follows from Eqs. (3) and (2) that 


where 


7(t) = J s(yu )e '>' T dp. 


•?(h) 


| s(cl> 0 + p) when p > —to 0 , 
0 when p < — a > 0 , 


m 


(5) 


From Eqs. (l)-(5) it follows that 


V(t) 


f s(p)e iftr d ft 

J —oo 


( 6 ) 


Suppose first that the spectrum is symmetric about the central frequency a> 0 . Then the 
“shifted" spectrum s(ps) will be approximately an even function of p and, consequently, the 
integral appearing in Eq. ( 6 ) will be real. It follows that in this case 

•V(t) = ±2 f j(p)cos(pr)dp. (7) 

J 0 

The ambiguity in the choice of the sign on the right-hand side of Eq. ( 6 ) arises from the fact 
that the expression ( 6 ) gives only the modulus of the integral. On taking the inverse of 
Eq. (7) we obtain for the shifted normalized spectrum the expression 

A If** 

s(ji) = s(o) fl + fi) = ±— | T ; (r)cos(/tr)dr. ( 8 ) 

7T J 0 

This formula shows that, when the spectrum is symmetric, the spectral energy distribution 
about the mid-frequency co Q may be calculated from measurements of the visibility curve, 
provided that the ambiguity in sign of the integral can be removed. This may usually be 
done by appealing to physical plausibility. 

If the spectrum is not symmetric, the Fourier transform of the “shifted" spectral density 
i(p) is no longer everywhere real and in such cases Eq. (8) no longer applies. In order to 
determine the spectral density in such cases one needs to know, in addition to the visibility 
curve, also the phase of the Fourier transform 7 ( 7 -) of s' f // ) or. alternatively, the phase of the 
complex degree of coherence. As mentioned earlier Lin the paragraph which follows 
Eq. (19) of Section 3.1], the phase of the complex degree of coherence can be determined 
from measurements of the location of the intensity maxima in the fringe pattern formed by 
the interfering beams. In Fig. 3.15 a result from Michelson's original determination by this 
technique of the energy distribution in spectral lines of thalium is reproduced. 





54 


Second-order coherence in the space-time domain 



0 0.5 


0 20 40 60 80 100 120 140 160 180 200 220 240mm 


Fig. 3.15 Two lines, each having a satellite, in the spectrum of thallium (left) and the 
corresponding visibility curve (right). (Adapted from A. A. Michelson, Light Waves 
and Their Uses, The University of Chicago Press, Chicago, IL. 1902, Reproduced from 
first Phoenix Science Series, University of Chicago Press, 1961, Fig. 64, p. 79.) 


In more recent times Michelson's method has been superseded by a related interferomet¬ 
ric technique, called Fourier spectroscopy, which >s also known as the interferogram 
method, and sometimes called the FTIR technique. It is used mainly in the infrared region 
of the spectrum. This method allows direct determination both of the real part and of the 
imaginary part of the degree of coherence '/(t). from which the normalized spectral density 
may be unambiguous!) determined. 


3.4 Propagation of the mutual intensity 


The van Cittert-Zemike theorem that we discussed in Section 3.2 implies that the spatial 
coherence of light changes on propagation. Specifically the theorem indicates that even a 
spatially incoherent source generates a field that is partially coherent and is, in some region 
of space, highly coherent. We will now generalize this result to propagation from an open 
surface on which the equal-time degree of coherence is known. 

Suppose that the complex amplitude at a typical point Q on a surface A intercepting a 
quasi-monochromatic light beam is V(Q. t) = t/(0exp( —iFit). Then, according to the 
Huygens-Fresnel principle (B&W, Section 8.2) the space-dependent part of the complex 
amplitude at a point P j iri the domain in which the beam propagates is given by the expres¬ 
sion (assuming small angles of incidence and diffraction) 



( 1 ) 


where R denotes the distance from Q, to P 1 (see Fig. 3.16). Similarly, 



(2) 
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Fig. 3.16 Notation relating to derivation of Zemike’s propagation law for the mutual 
intensity [Eq. (5) of Section 3.4], 


k = (ijIc being the mean wave number associated with the mean frequency a>. It follows 
that 

0*(P t )U{P 2 )) = ^7 / ( U*U2 l W(<2 2 ))———dQ l dQ 7 . (3 ) 

A" jq tj 

The expectation values appearing in this formula will be recognized as the mutual intensity at 
the points Q ] and Q 2 [cf. Eq. (20) of Section 3.1] and hence the formula (3) implies that 

J(P X , P 2 ) — -jj J ^ J(Q v Q 2 ’ &Q\ dQ 2 - (4) 


This formula was first derived by F. Zemike in his basic paper on coherence cited earlier, 
published in 1938, and is often called Zemike’s propagation law for the mutual intensity. 

If we recall the definition of the equal-time degree of coherence j{Q\, Q 2 ) [Eq. (21c) of 
Section 3.1] we may express Zernike’s propagation law (4) in the form 


jiP r P 2 ) 


Jnp^JnK) J* J* 


n , - - - Qik(R 2 /? t ) 

j(Q v Q 1 )yll(Q 1 )yll{Q 2 ) dQ l d Q 


1 ^ 2 ‘ 


(5) 












56 


Second-order coherence in the space-time domain 


This formula expresses the equal-time degree of coherence of the light at any pair of points 
P i and P 2 in the region of space into which the light propagates in terms of the equal-time 
degree of coherence and the averaged intensity of the light across the surface -1 The for¬ 
mula (5) is a generalization of the van Cittert-Zemike theorem and reduces to it when the 
light distribution on the (now planar) surface A is spatially incoherent [/((?], Q 2 ) ~ 
6 <2 \Q 2 — where 6 i2> is the two-dimensional Dirac delta function]. 

The formulas which we just derived apply to propagation in free space. If the space between 
the surface % and the points P\ and P 2 contains a linear medium or a linear optical system 
one can readily generalize the formulas to such situations. It is necessary merely to replace 
the “propagator” exp(iA'R)//? (R — QP) by an appropriate transmission function, say K(P, 
Q). Instead of Eq. (4) one then has 



( 6 ) 


This formula may be used to determine, for example, the equal-time degree of coherence 
in the image plane of a light source, a situation discussed in detail by Zernike in his classic 
paper. 

In this section we have considered only propagation of the mutual intensity J(P\, P 2 )- 
One can also formulate laws for the propagation of the more general mutual coherence 
function TIP], P 2 , r) which are somewhat more complicated. They may be derived from 
rigorous propagation laws for the mutual coherence function, to which we will now turn 
our attention. 

3.5 Wave equations for the propagation of mutual coherence in free space 

We saw in Section 3.2 that the mathematical formulas which express the van Cittert-Zemike 
theorem for the mutual intensity and the equal-time degree of coherence closely resemble 
a well-known formula of elementary' diffraction theory , namely the Huygens-Fresnel prin¬ 
ciple. The propagation law for the mutual intensity which w e just derived likewise resem¬ 
bles the Huygens-Fresnel principle. We will now show that there is a deeper reason for these 
analogies than might appear to be the case from our formal derivation. 

Consider an ensemble {Fir. f)} representing a complex wavetield in free space. 1 Each 
member of the ensemble satisfies the wave equation 



( 1 ) 


1 In the preceding sections we frequently denoted points by capital letters, such as P, Q, and S. In the general the¬ 
ory it is. however, more convenient to represent them by position vectors such as rj, r, etc. In the subsequent 
sections we will use either of the notations, whichever is more convenient. 
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Let us take the complex conjugate of this equation, replace r by iq and t by t and multiply 
the resulting equation by V(r 2 . t 2 ). This gives 




1 

c 2 dt 2 


U) 


V'l.r, 


( 2 ) 


where V? is the Laplacian operator acting with respect to the point r,. Next let us take the 
ensemble average of both sides of this equation and interchange the order of the various 
operations. We then obtain the equation 


V 1 2 (r(r 1 ,f i m.r 2 ,r 2 )) = -L (v* (r f )V(r 2 . r 2 )). 
c- otr 


(3) 


If the field is statistically stationary', at least in the wide sense as we now assume, then 

(V'*(r 1 ,/ 1 )V(i^,r 2 )} = {V*(r 1 ,r)V'(r 2 , t + t 2 -r,)) = Tfr,,r a ,r), (4) 


t = t 2 ~ t 1, where T(r,. r : , r) is the mutual coherence function of the field [Eq. (6) of 
Section 3.1]. Evidently d 2 ldt\ — d^-ld-r and hence Eq. (3) implies that 




In a stric tly similar manner we find that 


V* 2 nr 1 .r 1 .r)-Tf£M 

C~ OT- 


(5a) 


(5b) 


where Vt is the Laplacian operator acting with respect to the point r 2 . 

The two wave equations 1 (5a) and (5b) for the mutual coherence function hold rigor¬ 
ously for propagation in free space. From them one can obtain at once equations which are 
valid, to a good approximation, for the propagation of the mutual intensity in free space. 
They follow on substituting for T from Eq. (22) of Section 3.1 into the wave equations (5). 
One then obtains the two equations 

V 2 /(r,. r 2 ) + k 2 J(r v r 2 ) =0. (6a) 


and 


where 


V;/(r r r 2 ) + k 2 J( r,. r 2 ) ~ 0. 


k = 


(D 

C 


(6b) 


(7) 


1 Actually the two equations (5) are not independent, because of the relation Fir!, r 2 , t) = r*(r 2 . r,. -r). 
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provided that the inequality given by Eq. (24) of Section 3.1 holds, i.e. provided that 


|r| <S 


2;r 

Am 


( 8 ) 


Equations (6) explain why the van Cittert-Zemike theorem resembles closely the 
Huygens-Fresnel principle of elementary diffraction theory. This principle is a conse¬ 
quence of the fact that a monochromatic optical field satisfies the Helmholtz equation. We 
have just shown that the mutual intensity of a stationary random field also obeys, approxi¬ 
mately, the Helmholtz equation. This common property implies that in both cases the prop¬ 
agation is. to a good approximation, governed by the Huygens-Fresnel principle. 


PROBLEMS 

3.1 A plane, polychromatic light wave, whose spectrum consists of a line with Gaussian 
profile, is incident normally on a screen A containing two pinholes. At each instant of 
time the complex wave amplitudes at the two pinholes are the same. Derive an expres¬ 
sion for the visibility of the interference fringes observed at a point P on a screen B 
parallel to > 1 , at distances r, and r 2 from the pinholes. 

3.2 A double star consists of two components which subtend on Earth the same angular 
diameters 2o and have angular separation 20. The stars may be regarded as having 
uniform circular cross-sections and radiating at the same mean wavelength. The ratio 
of the brightnesses of the two components is 1 : b. The light receiv ed from the star is 
passed through a filter so as to make it quasi-monochromatic. 

(a) Derive an expression for the equal-time degree of coherence j I3 of the star light in 
the observing plane of a Michelson stellar interferometer. 

(b) If /3 » a, show how 0 may be determined from the visibility curve. 

3.3 A Michelson interferometer is illuminated by a quasi-monochromatic light beam hav¬ 
ing a rectangular spectral distribution of width Aw. centered on frequency w. The 
interference fringes first vanish w'hen one of the mirrors is displaced by a distance d () 
from its symmetric position with respect to the other mirror. Determine the degree of 
self-coherence "i(r, r, r) of the light at the beam splitter of the interferometer and using 
it, calculate the width Aw of the spectral distribution. 

3.4 A spectrum of light consists of N “lines' of the same profile but of different intensities, 
centered at frequencies w,, w 2 ,. . ., w v . The light is analyzed by means of a Michelson 
interferometer. 

(a) Derive an expression for the visibility curve. 

(b) Discuss in detail the case when the spectrum consists of two lines (N = 2) of the 
same intensities and with identical profiles that are of Gaussian form. Show also 
how the separation of the two lines may be determined if it is assumed to be large 
relative to the effective width of each line. 
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3.5 The mutual intensity function for all pairs of points on a surface intercepting a station¬ 
ary quasi-monochromatic light beam is of the form 

A r i> r 2 ) = f(r ] )g(r 2 ). 

where/(r) and g(r) are known functions of position on the surface. Show that 

(a) g(r) = af (r), where a is a real constant and the asterisk denotes the complex con¬ 
jugate; and 

(b) the light in the space into which it propagates is completely spatially coherent 
within the framework of second-order coherence theory. 


3.6 Consider a real random source-distribution Q (n (r, r), localized for all times within a 
finite volume D in free space, and let V (r) (r, t) be the field which the source generates. 
Q [x) and V TI are related by the inhomogeneous wave equation 

1 d 2 V {I) {r t) 

V 2 V <r) (r. t) - = -47r£ ,r) (r, t). 

c 2 dt~ 

Show that if Q (t> ( r, r) and V /<r) (r, t) are stationary random processes, and Q( r, t ) and 
Vfr, t) are the associated analytic signals, then the cross-correlation functions 

r e ( r ,, r 2 , r) = {Q*(r v t)Q(r z , t + r)) 


and 


r vO r i*r 2 ,r) = (V*(r,, t)V(r 2 . t + r)) 


are related by the equation 


rr: 1 

1 ^ 

2 c 2 dr 2 

Vl c 2 dr 2 


r v (r,. r : , T) = (4—)-(r,, r,. r). 


3.7 The mutual coherence function of a certain stationary' optical field in free space has 
the form 

T(r,, r 2 , r) = F(r,. r 2 )C(r). 

Show that the function F(r t , r 2 ) must satisfy two Helmholtz equations and determine 
the most general form of G(t). 
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Second-order coherence phenomena in 
the space-frequency domain 


Up to relatively recent times, second-order coherence phenomena have usually been 
described in terms of the space-time correlation function, namely the mutual coherence 
function l'(r ,, r 2 , r), or in terms of a space correlation function, i.e. the mutual intensity 
/(i*i, r 2 ). Using them we defined, in Section 3.1, the complex degrees of coherence ^(r j, r 2 , 
t) andj(ri, r 2 ). More recently an alternative description was developed, which has consid 
erable advantages in the analysis of many problems involving statistical wavefields. It 
employs certain functions which were originally introduced rather formally in terms of the 
Fourier transforms of the mutual coherence function, but later they were found to be also 
correlation functions, associated with ensembles of realizations, that are functions of posi¬ 
tion and frequency, rather than of position and time. This step is not as trivial as it might 
appear at first sight because, as we noted in Section 2.5, the sample functions of a station¬ 
ary random process do not have a Fourier frequency representation. 

The newer space-frequency representation turned out to be very useful for providing 
solutions to many problems and it has led to the disco\er\ of some new effects, some of 
which we will discuss in this chapter. 

4.1 Coherent-mode representation and the cross-spectrai density 
as a correlation function 

As we already noted, in the space-time formulation of coherence theory of stationary opti¬ 
cal fields, the basic quantity is the mutual coherence function 


r(r,,r 3 ,r) = (V*(r, J f)F(r-,,f + r)). 


(1) 


In the space-frequency formulation, the basic quantity is the cross-spectral density func¬ 
tion W(r,, r 2 , «), which is its Fourier frequency transform, i.e. 



(2) 
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For later purposes we note that in free space the cross-spectral density obeys the two 
Helmholtz equations 

ViMr,, r 2 , <u) + IbWAr,, r 2 , «) = 0 (3a) 

and 

V 2 Wji, r 2 , oj) + k 2 W(r h r 2 , w) = 0, (3b) 

where Vj and V 2 are the Laplacian operators acting with respect to the points r, and r 2 . 
respectively, k = a>/c is the free-space wave number, c being the speed of light in vacuum. 
These equations follow at once from the two wave equations (5a) and (5b) of Section 3.5 
w hich are satisfied by the mutual coherence function in free space, on taking their Fourier 
transforms. 

We have already encountered the cross-spectral density in a somewhat broader context, in 
connection with the generalized Wiener-Khintchine theorem [Eqs. (12a) and (12b) of 
Section 2.5] of the theory of stationary random processes. However, in that treatment the 
cross-spectral density appeared in a “singular formula” containing the Dirac delta function 
[Eq. (11) of Section 2.5]. In the space-frequency formulation of coherence theory, the cross- 
spectral density is introduced in an alternative way, within the framework of ordinary func¬ 
tion theory, as a correlation function of a statistical ensemble of well-behaved realizations. 

Let us consider an optical field in a closed domain D in free space. Then it can be shown 
(M&W, Sections 4.7.1 and 4.7.2) that, under very general conditions (Hermiticity, non¬ 
negative definiteness and square-integrability of W over D), the cross-spectral density of 
the field at any pair of points r, and r 2 in D may be expressed in a (generally infinite) series 

W (r,, r,, cu) = ^A 9 (w)^( r,, cu)^(r 2 t m), ( 4 ) 

n 

The functions <j> n may be shown to be the eigenfunctions and A„ the eigenvalues of the inte¬ 
gral equation 

f D W(r v r 2 ,aj)0 n (r v u))d3r l = X n (co)o n (r 2 .w). (5) 

The eigenfunction (j) n may be taken to form an orthonormai set over the domain D, i.e. 

f D € (r -°'K, (r, w)d 3 r = 8 nm , ( 6 ) 

6 nm being the Kronecker symbol ( 8 nm = 1 when n = m, 8 nm = 0 when n + m). The quanti¬ 
ties A„(cu) [the eigenvalues of the integral equation (5)] are positive, i.e. 

A„(<w) > 0 (n 2*0). (7) 

The summation in Eq. (4) must be interpreted as follows. If D is a three-dimensional 
domain, n stands for the triplet (« b m, n 3 ) of non-negative integers and E stands for a triple 
sum. If the domain is two-dimensional n stands for a pair of non-negative integers, « , and 
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n 2 , and E stands for a double sum. If the domain is one-dimensional, n stands for a non¬ 
negative integer and one has a single summation. 

In free space, each function o„(r. at) obeys the Helmholtz equation 

V 2 ^„(r, co) + tc<p n ( r. at) = 0. (8) 

To derive this result we substitute the expansion (4) of the cross-spectral density into the 
Helmholtz equation (3b), multiply the resulting equation by <fr m (r h w), integrate both sides 
with respect to iq over the domain D and use the orthonormality relation (6). For reasons 
which will become evident later, the expansion (4) is known as the coherent-mode repre¬ 
sentation of the cross-spectral density. 

We will now show that by using the expansion (4) of the cross-spectral density, one can 
construct an ensemble {U(r, co)} of sample functions U(r. co) in terms of which the cross- 
spectral density of the held in the domain D may be expressed as a correlation function. 

Let us consider the ensemble of sample functions of the form 

U(r.co) = Va n (oj)^(r,aj), (9) 

n 

where the a„(w) are random coefficients such that 

«( 10 ) 

and the A„(o>) are the same positive quantities as appear in the expansion (4), 1 i.e. the eigen¬ 
values of the integral equation (5). 

Next we consider the correlation function {E/’(iq. a>)£/(r 2 , co)). One has, on using the 
expansion (9), 

(t/*(r lt &))£/(%, iu))^ = V;^( cl *(a,x7 m ( W ))c>*(r 1 ,«)d )ffl (r 2 ,m), (11) 

n m 

where we have interchanged the order of the ensemble average and the double summation. 
On using (10), the expansion (11) simplifies: 

p*(r r a))U(r 2 ,co)) w = V Ajaj)o*(r r w)<p n (r,,w). (12) 

n 

Since the right-hand sides of Eqs. (12) and (4) are the same, the left-hand sides must be 
equal to each other and hence we have established the important result that 

VHr,, m, co) = cu)[/(rq>, co)) w . (13) 


1 There are many ways of choosing such random coefficients. For example one can take a n ( oj) = ^A n (<u)e lfl ", 

where, for each n, 9 n is a real random variable that is uniformly distributed in the range 0 =S 9„ < 2ir and 9„ and 
9 m are statistically independent when n m. With this choice the requirement (10) is satisfied. 
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We have attached the suffix to on the angular brackets to stress that the average is taken over 
an ensemble of space-frequency realizations. It is an ensemble w hich is quite different from 
the ensemble taken over the space-time realizations V{ r, /). which we encountered earlier. 

The formula (13) is an important result. It shows that the cross-spectral density of a sta¬ 
tistically stationary fluctuating field in the domain D may be expressed, for all pairs of 
points in D, as a cross-correlation function of an ensemble {U( r, «)} of space-frequency 
realizations U(r at). 

Because each of the functions <j>„( r, to) satisfies the Helmholtz equation ( 8 ) it is clear that 
the left-hand side of Eq. (9) also satisfies that equation and. consequently. 


V 2 t/(r, to) + k 2 U( r, at) = 0. 


(14) 


Hence we may regard each sample function U( r, w) of our ensemble as being the space- 
dependent part of a monochromatic wavetield V(r, t) = U (r culexpt -\tot). This fact 
contributes towards an intuitive understanding of many of the results of the second-order 
coherence theory in the space-frequency domain. For example. Eq. (13) implies that the 
spectral density 5(r, to) - W(r. r, to) of the fluctuating field '/( r, t) at a point r may be 
expressed in the form 


S(r, to) = (if (r, co)U( r, w)) a . 


(15) 


This formula is similar to the formula based on the common intuitive belief that the spec¬ 
tral density is the average of the squared modulus of the Fourier frequency components of 
the fluctuating field V(r, /). However, as we learned in Section 2.5, a stationary random 
field F(r, t) does not have a Fourier frequency spectrum. Nevertheless the formula (15) is 
rigorously valid, but one must appreciate that U{ r. w) is not a Fourier frequency component 
(which does not exist) of the fluctuating field but is the space-dependent part of a member 
of the statistical ensemble {V(r. t) = Ui r. w)e _1 “ f } of monochromatic realizations, all of 
frequency <u. The distinction between a monochromatic field and an ensemble of mono¬ 
chromatic fields of the same frequency is crucial. Once this fact is appreciated, one can use 
the space-frequency representation with great advantage to study second-order coherence 
phenomena in stationary wavefields, as we will soon see. 


4.2 The spectral interference law and the spectral degree of coherence 

In Section 3.1 we introduced a (generally complex) space-time correlation coefficient, 
namely the degree of coherence 7 ( 1 * 1 , r 2 , r), from the analysis of Young's interference 
experiment. In this section we will introduce a space-frequency correlation coefficient, 
also from the analysis of Young's interference experiment, but with the difference that 
instead of considering the distribution of the intensity in the detection planeTve will con¬ 
sider the spectrum of the light in that plane. For this purpose it is not necessary that the light 
illuminating the pinholes is narrow-band: on the contrary, the effect of superposing two 
beams emerging from the pinholes on the spectrum of the light in the detection plane 
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P 


Fig. 4.1 Notation relating to Young's interference experiment with broad-band light. 


becomes more pronounced when the light incident on the pinhole contains a broad band of 
frequencies, as we will soon see. 

Let us once again consider light of any state of coherence, incident from the left on an 
opaque screen ^containing two pinholes, at points 0. and Q 2 (Fig. 4.1). As is clear from 
the results that we have just established, we may represent the field at the pinholes by 
ensembles of frequency-dependent realizations {U(Q u co)} and { U(Q 2 . &>)}. We assume 
that the pinholes are sufficiently small that the amplitude of the field is effectively constant 
over each of them and also that the angles of incidence and of diffraction are small. Then 
the field at a point P on a detection plane $ some distance beyond the screen M and paral¬ 
lel to it is, to a good approximation, given by an ensemble of realizations [U(P, qj) } where 


U(P.co) = K { U(Q r co )e' kR ' + K 2 HQ z .co)e ' kR -, 


( 1 ) 


K t and K 2 being defined by Eq. (3) of Section 3.1, with A being replaced by the wavelength 
A corresponding to the frequency co. i.e. A = 2i\cko and R, and R 2 are, as before, the dis¬ 
tances from £>, to P and from Q 2 to P, respectively. 

Let us substitute from Eq. (1) into the expression for the spectral density [Eq. (15) of 
Section 4.1]. If we also use the fact that W(Q 2 , Q |. co) = W r (Qi, Q 2 , to), we obtain for the 
spectra] density at the point P the expression 


S(P, to) = IffjpStGu to) + \K 2 \ 2 S(Q 2 , to) + 2 %e{K*,K 2 W(Q h Q 2 , co) e -% (2) 


where 


6 = —(R, - R 2 ). 
A 


(3) 


The factors K\ and K 2 are proportional to the areas of the pinholes. If we let the area of the 
pinhole at Q 2 decrease to zero, Eq. (2) will then represent the spectral density, S' l \P, co) 
say, at the point P. with just the pinhole at Q t open. i.e. 


|Ar 1 p5(j3 1 , co) = S'^iP. co). 


(4a) 
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Similarly, 


\K 2 \ 2 S(Q 2 , to) a S ( - 2 KP, to). (4b) 

represents the spectral density at P with just the pinhole at Q 2 open. Hence the formula (2) 
may be rewritten in the physically more significant form 

S(P,co) = S {1 HP,oj) + SV\P,(o) 

+ 2yS<'HP,tu)JsVKP,a) . (5) 


where 


p(Q v Q 2 .a>) 


W(Q ] .Q 1 ,co) 

y]w {Q l ,Q 1 ,a>)yjw(Q 2 ,Q 2 ,co) 
W(Q v Q 2 ,to) 
^]S(Q ] .co)^jS(Q v co) ' 


( 6 a) 

(6b) 


If we set 

)KQ v Q v co) = \fi(Q v Q 2 .co)\^^ 

the expression (5) for the spectral density at the point P in the observation plane becomes 
S(P,co) = S ( 1 >(P,w) + S ( 2 ) (P,«) 

+ 2 yjs® (F, to)yjs (2 > (P. to) 1#*^, Qy ,calicos^ .Q 2 , to) - 5J. (8) 

This formula is called the spectral interference law for the superposition of beams of any 
state of coherence. In its mathematical structure it is of the same form as the “intensity inter¬ 
ference law” [Eq. (14) of Section 3.1J; but its meaning is different. The intensity interference 
law' is an expression for the averaged intensity at a point P in the interference pattern, 
whereas the formula ( 8 ) is an expression for the spectral density of the light at that point. 

Before turning our attention to some consequences of the spectral interference law, we 
will briefly comment on the physical significance of the factor //(Q,, Q 2 , a>) which plays an 
analogous role to the complex degree of coherence 7(<2 i, Q 2 - T )- This is evident on compar¬ 
ing Eq. ( 8 ) w'ith Eq. (14) of Section 3.1. The expressions ( 6 ) by which p(Q\, Q 2 , u>) has been 
introduced show that it is the normalized cross-spectral density of the fluctuating field at the 
points Qi and Q 2 . We have already learned that the cross-spectral density Wi r,. r 2 , to) may 
be interpreted as a correlation function in the space-frequency domain [Eq. (12) of Section 
4.1] By using the fact that the cross-spectral density function is non-negative definite (or by 
use of the Schwarz inequality) one may show [see B&W, Appendix VIII. p. 911] that its nor¬ 
malized factor plQi, Q 2 , <o) is bounded in absolute value by zero and unity, i.e. that 


0^\p.(Q h Q 2 ,w)\ S3 1 


(9) 
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for all values of its argument. The extreme value |p| = 1 represents complete correlation; 
the other extreme value, p = 0, represents absence of correlation. For this reason the nor¬ 
malized cross-spectral density /i(0 ]t 0 2 > w > is called the spectral degree of coherence at 
frequency co of the light at the points 0 , and 02 - 

Let us now return to the spectral interference law (8). Usually S^ 2, (P, at) ~ S {X \P, cn) and 
the spectral interference law then takes the simpler form 

S(P , co) = 2 S (U (P, o>){l + | p(Q h Q 2 . «h)|cos [/5(0„ Q 2 , co) - 6]}. (10) 

We note two implications of this formula. First, at any frequency co, the spectral density 
varies sinusoidally with the position of the point P across the detection plane % with the 
amplitude and phase depending on the spectral degree of coherence. Secondly, at any fixed 
point P the spectral density S(P, co) will, in general, differ from the spectral density oj) 

of the light which would reach the point P through only one of the two pinholes, the differ¬ 
ence depending on the spectral degree of coherence p.(Qi, Q 2 , co). This difference in the two 
spectra is an example of the phenomenon of correlation-induced spectral changes, which 
we will discuss shortly. 

In a sense the “intensity” interference law given by Eq. (8) of Section 3.1 and the spectral 
interference law (8) are complementary to each other. The former shows that appreciable mod¬ 
ifications of the averaged intensity take place when narrow-band quasi-monochromatic light 
beams are superposed. The latter indicates that appreciable changes of spectra may take place 
when two broad-band beams are superposed. More detailed analysis reveals that in the former 
case no appreciable spectral changes take place, whereas in the latter case no appreciable inten¬ 
sity variations occur. Moreover, no interference fringes are formed when the path difference 
intioduced between the two beams exceeds distances of the order of the coherence length of the 
light, whereas spectral modulation takes place irrespecth e of the phase difference 1 6, defined 
by Eq. (3), as is evident from the spectral interference law (8). Figure 4.2(a) shows the exper¬ 
imental set-up for illustrating this effect. Results of the experiment are shown in Fig. 4.2(b). 

Spectral changes have also been observed in star light passed through two slits in an 
opaque screen and then superposed. 2 From such changes, shown in Fig. 4.3, the spectral 
degree of coherence of the star light reaching the Earth was determined by the use of the 
spectral interference law (8). In principle, one can estimate from such measurements the 
angular diameter of the star with the help of the van Cittert-Zemike theorem (that we dis¬ 
cussed in Section 3.2). This method must be distinguished from Michelson’s method for 
measuring stellar diameters, which we described in Section 3.3.1. Michelson’s method is 
based on determining the correlation between the fluctuating fields at two mirrors of an 
interferometer from visibility measurements, whereas the method which we just mentioned 


1 Similar effects have been found in interference experiments with matter waves, specifically with neutron beams 
[see, for example H. Rauch, Phys. Lett. A173 (1992). 240-242 and D. L. Jacobson. S. A. Werner and H. Rauch, 
Phys, Rev. A49 (1994), 3196-3200. See also G. S. Agarwal. Found. Phys. 25 (1995), 219-228. 

2 H. C. Kandpal, A. Wasan, J.C. Vaishya and E.S.R. Gopal Indian J. Pure Appl Phys. 36 (1998), 665-674. In 
this paper measurements of the frequency dependence of the spectral degree of coherence of light from a star 
(a-Bootis) were also reported. 



4.2 The spectral interference law 


67 




Fig. 4.2 Spectral changes generated on superposition of two partially coherent, broad¬ 
band light beams, emerging from two slits, (a) Layout of the experiment with 
= 0.68mm. D 2 = 3.4mm. a = 0.026mm. i> = 0.11mm. (b) Measured values, 
denoted by circles and interpolated by the solid curves and the original spectrum (bro¬ 
ken line). [After M. Santarsiero and F. Gori. Phys. Lett. A167 (1992), 123-128.] 

is based on measurements of the spectral changes arising on interference between two 
beams, i.e. it makes use of correlation-induced spectral changes. 

In discussing the “intensity interference law” in Section 3.1, we showed that the absolute 
value of the complex degree of coherence yf Q { , Q 2 , t) can be determined from measure¬ 
ments of the visibility of interference fringes [Eq. (19) of Section 3.1], We will now show 
that the absolute value of the spectral degree of coherence fi(Q u Q 2 , to) may be determined 
in a somewhat similar manner. For this purpose we note from the spectral interference law, 
specialized to the case when S {2) (P , to) = S a, (P , to) [Eq. (10)], that the spectral density 
S'( P. w) at any fixed frequency to has a maximum in the neighborhood of a point P in 
the plane of observation, when the path difference (/? — <5) is such that the cosine term in 
Eq. (10) has the value +1 and that it has a minimum when it has the value —1. These 
extreme values of the spectral density evidently are 

S WX {P, to) = 25 <] »(cu)[l + | )i(Q u Q 2 , tu)|J s (11a) 

S mm (P, to) = 25 (1, (tu)[l - | p(Q h Q 2 , oj)|]. 


(lib) 
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Fig. 4.3 Spectra of the stars a-Bootis (a) and ci-Scorpio (b) at an observation point P 
on axis. Curve S] is the recorded spectrum w hen slit P { is open and P 2 is closed. The 
curve S 2 is the recorded spectrum when the slit P 2 is open and P ( is closed. Curve S 
is the recorded spectrum when both P l and P 2 are open. The dotted line represents the 
sum of the two spectra given by the curves S! and S 2 . [After FI. C. Kandpal, A. Wasan, 
J. S. Vaishya and E. S. R. Gopal, Indian J. Pure Appl. Phys. 36 (1998), 665-674.] 


By analogy with the definition Eq. (18) of Section 3.1 of the fringe visibility, we now intro¬ 
duce the concept of spectral visibility ViP. of). defined by the expression 


V(P, w) 


GP,to) + S mm (P,to)' 


( 12 ) 


On substituting from Eqs. (11) into Eq. (12) it follows that 

V(P, to) = | p(Q u Q 2 , to)|. (13) 

This formula shows that the absolute value of the spectral degree of coherence can be 
obtained from Young’s interference experiment by letting the light from the two pinholes 
pass through narrow-band filters which transmit a narrow portion of the spectrum centered 
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on the selected frequency oj. The argument (phase) of /r can also be determined experimen¬ 
tally, from measurements of the locations of the maxima and the minima, in a similar man¬ 
ner (mentioned in Section 3.1 and described in B&W, Section 10.4.1) to what can be done 
to determine the phase of f, but, in addition, one must again use narrow-band filters. Other 
techniques for measuring both the modulus and the phase of the spectral degree of coher¬ 
ence have been described in several publications. 1 

With the interpretation of the normalized cross-spectral density (6) as the spectral degree 
of coherence, one can readily understand why the expansion Eq. (4) of Section 4.1 is 
known as the coherent-mode representation of the field. For this purpose let us rewrite the 
expansion in the form 

W (r,, r 2 , w) = (14) 

n 


where 


W <, ”(r l , r 2 , oj) = A„(«)<A>*(ri, oj)<p n {r 2 , to). 


(15) 


The spectral degree of coherence associated with the contribution of W ( ' ,) is given by the 
expression 


/i (n) (rj, rj.w) 


W (n, (r P r 2 ,w) 

\(co)4>*(r v u))0 n (r 2 ,(o) 

v/V w) k (r !’ w) l 2 V A « (w) l^ (r 2’ w) l 2 


(16) 


implying that 

| M W(r b r 2 , «)| = 1. (17) 

In going from the first to the second expression on the right of Eq. (16) we used the fact that 
the spectral density S {n) ( r, oj) = r. oj). 

Equation (17) shows that each term (mode) in the expansion Eq. (4) of Section 4.1 rep¬ 
resents a field which is completely spatially coherent at frequency oj. 

It may be shown (M&W. Section 7.4) that laser modes provide examples of such coher¬ 
ent modes. 


4.3 An illustrative example: spectral changes on interferences 

The spectral interference law that we have derived in the preceding section has a number of 
interesting implications and some useful applications. In this section we will show how it 
may be used to determine the angular separation of distant objects. 2 

1 D.F.V. James and E. Wolf, Opt. Common. 145 (1997), 1-4: S.S.K. Titus. A. Wasan, J.S. Vaishya and H. C. 
Kandpal, Opt. Common. 173 (2000), 45^19; V. N. Kumar and D. N. Rao, J. Mod. Opt. 48 (2001), 1455-1465: 
G. Popescu and A. Dogariu. Phys. Rev. Lett. 88 (2002), 183902 (4 pages). 

2 The analysis of this section is based on a paper by D. F. V. James, H. C. Kandpal and E. Wolf, Astrophvs. J. 445 
(1995), Part I, 406-410. 
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When the spectral densities of the light reaching the observation point P from each of the 
pinholes are the same [i.e. S (]) (P. w) = S a \P. w)] we have, according to the spectral inter¬ 
ference laws [Eq. (10) of Section 5.2], 


S(P,co) 

2 S a HP,o)) 


1 + |ju(Qi,Q 2 .«)}cos[^(2 11 (9 31 w) -(5]. 


(l) 


Let R i and R 2 denote, as before, the distances from each of the tw o pinholes at points Q, 
and Q 2 to the point of observation P. d the distance between the pinholes and x the distance 
of the point of observation from the axis (see Fig. 4.4). Suppose that x is small relative to 
the distance R between the plane of the pinholes and the observation plane, i.e. that 
x/R « 1. Then AS - R, ~ xd/R and the path difference 6 = k(R 2 — R t ) = ojxdHRc). 
Under these circumstances the formula (T) becomes 


S(P,co) 

2 S m (P,w) 


1 + , Q 2 . «>jcos[ J( Q r Q : .(i>) - wxdKRc)}. 


( 2 ) 


Suppose that the source is a pair of identical circular disks, e.g. a somewhat idealized dou¬ 
ble star, each of uniform intensity %. The intensity distribution / 0 (p) across this two-com¬ 
ponent source, each component of which is assumed to be spatially incoherent, may be 
expressed in the form 

I 0 (P) = 'o{circ[|p + b 0 /2|/«] + circ[[p - b 0 /2|/a]}, (3) 

where a denotes the radius of each of the two circular sources, b 0 is the vector specifying 
the location of the center of one of them relative to the other, i 0 is the intensity, assumed to 
be constant, of either of the two sources and 


circ(.v) = 


1 when 0 < ].v| < 1, 
0 when |.vj > 1. 


(4) 
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Fig. 4.5 Illustrating a method for determining the angular diameters and angular sepa¬ 
ration of two circular sources from spectroscopic measurements. 


The spectral degree of coherence of the light reaching the pinholes from the source can 
readily be calculated by substituting from Eq. (3) into the far zone form Eq. (20) of Section 
3.2, of the van Cittert-Zernike theorem with the equal-time degree of coherence j(Q\, CM 
replaced by the spectral degree of coherence /. i(Q\, Q 2 , w). One then finds that 


l i(Q r Q 2 .w) 


2 [aa)dt(cr)] 
awd/(cr) 


cos 


o>db 0 
2 cr 


(5) 


where r is the distance between the “source plane” and the plane of the pinholes (see Fig. 
4.5) and d is the (vectorial) distance between the two pinholes. For the sake of simplicity 
we have assumed that the line joining the pinholes is parallel to the line joining the centers 
of the two circular sources, i.e. that d is parallel to b n . 

On substituting from Eq. (5) into Eq. (2) it follows that 


S(P,w) 

2 S il HP.w) 


1 + 


2J t [aaxl/{cr)\ 
cioj/(cr ) 


cos 


<ud-b 
2 cr 


cos 


(oxd 
cR 


( 6 ) 


where we have used the fact that the factor containing the Bessel function is real and posi¬ 
tive and that R> d. 

Figure 4.6 shows the behavior of the ratio 2 S(P, w)/S il \P. w) as a function of the fre¬ 
quency oj, for some selected values of the parameters. Several features are worth noting: 
first, the rapid sinusoidal modulation, due to the factor cos[w.r<//(c7?)]; and secondly, the 
contrast of the spectral modulation due to the frequency dependence of the spectral degree 
of coherence, given by Eq. (5), of the light at the two pinholes. One can readily identify the 
two different causes of the modulation as being due to (i) the size of the source and (ii) the 
separation of the two sources. Thus, in principle, the angular radii of the sources and their 
angular separation can be deduced from such spectral measurements made at fixed pinhole 
separation. These theoretical predictions have been verified by experiments 1 (see Fig. 4.7). 

1 H. C. Kandpal, K. Saxena, D. S. Mehta. J. S. Vaishya and K. C. Joshi, J. Mod. Opt. 42 (1995). 447-454. 



























S(P, (D) 



Fig. 4.6 The spectrum produced on superposing, at a point P two beams from circular 
sources, each of angular radius a = 3 X 10 -8 . The angular separation of the two sources is 
A = 3 X 10' 7 , the path difference (R 2 — Ri) = 10 pm and the baseline d = 5 m. [Adapted 
from D. F. V. James, H. C. Kandpai and E. Wolf, Astrophys. J. 445 (1995). 406-410.] 



Frequency. <n( 10 15 s ' ) 


Fig. 4.7 Results of the first laboratory experiments testing the theoretical predictions 
shown in the previous figure, with the same values of the parameters. Dashed line: the¬ 
oretical prediction; solid line: results of experiments. [Adapted from H. C. Kandpai, 
K. Saxena, D. S. Mehta, J. S. Vaishya and K. C. Joshi, J. Mod. Opt. 42 (1995), 447^154.] 
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4.4 Interference of narrow-band light 


It is frequently assumed that, when light of sufficiently narrow bandwidth, centered at fre¬ 
quency w ( ), say, interferes, sharp fringes (i.e. fringes essentially of unit visibility) are 
formed, imitating interference of monochromatic light of that frequency. We will now show 
that this assumption is incorrect; and also that interference fringes formed with narrow-band 
light exhibit a number of interesting features. 

Suppose that we place identical filters in front of each of the pinholes at points Q A and 
Qi in Young's experiment. We will consider how the passband of the filters affects the 
interference pattern. 


Let 


W®(0 L , 02, ( 0 ) = (t/ (i) *(0„ w)[/ (i \0i, a/)} 


( 1 ) 


be the cross-spectral density of the light incident on the pinholes. The expectation value 
on the right of Eq. (1) is to be understood in the sense of coherence theory in the space- 
frequency domain, which we discussed in Section 4.1. For the sake of simplicity, we have 
now omitted the subscript oj on the angular brackets. 

Let T(w) be the transmission function of each filter. The cross-spectral density function 
of the filtered light which emerges from the filters is then given by the expression 


W^ + HQ V Q v oj) = (T*(oj)U^(Q v oj)T(oj)U (i fQ r oj)} 
= \T(wfW {i HQ v 0 2 ,o/), 


( 2 ) 


where we have made use of Eq. (1). 

The spectral degree of coherence of the light transmitted by the filters, immediately 
behind them, is given by the expression 



W^(Q v Q y oj) 


(3) 



On substituting from Eq. (2) into Eq. (3) we find at once that 


t^iQuQi, w) = ^\Qj.Q 2 .co). 


(4) 


where 



W"HQ v Q,.u) 


(5) 



is the spectral degree of coherence of the light incident on the pinholes. Formula (4) shows 
that the spectral degree of coherence is unchanged by linear filtering. 
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Next let us consider the effect of the filters on the “space-time"’ degree of coherence 
7 (Q h Qi, t) which we discussed in Section 3.1. For light incident on the two pinholes, it is 
given by the expression 


7 (1 HQ v Q 2 ,t) 


^ Hq v q 2 ,t) 

<Jr (i HQ v Q 1 ,T)jF i HQ 2 ,Q 2 ,T) ’ 


( 6 ) 


where L'S {Q ,. Q 2 , r) is the mutual coherence function, defined by Eq. (1) of Section 4.1. 
According to the inverse of Eq. (2) of Section 4.1. it is just the Fourier transform of the 
cross-spectral density function, i.e. 

r* n (G r e 2 .r) = f o W' i HQ r Q y oj)e-^ dm, ( 7 ) 


where the integration extends over the positive frequencies only, because of our use of the 
analytic signal representation. 

On using Eq. (2) it is clear that the mutual coherence function of the light emerging from 
the filters is 


r i+ HQ r Q„T)= r\T{wfW^{Q■ Q flOe-^dcw. (8) 

Suppose that the effective bandwidth Aw of the filters is so small that the absolute value 
and the phase of the cross-spectral density VV < 1 , ({2 1 . Q 2 . w) of the incident light are effec¬ 
tively constant over the bandwidth Aw of the incident light (see Fig. 4.8). If w () is the mean 
frequency of the incident light w e may then evidently approximate Eq. ( 8 ) by the formula 

r t+ l(G..G 2 .T)~ lF ( ' : (0 r 0..w n )fV(w)| 2 e H ^dr. (9) 



Fig. 4.8 Schematic illustration of the relative behavior of the modulus of the transmis¬ 
sion function T(co) of identical filters placed in front of the pinholes in Young’s interfer¬ 
ence experiment and of the modulus of the cross-spectral density W(Q t . Q 2 , <w) of the 
light incident on the pinholes, in experiments with narrow'-band light. The effective 
pass-band w f) — Aw/2 < w £ w 0 + Aw/2 of the filtered light is assumed to be so 
narrow/ that the modulus and the phase (not show>n in the figure) are substantially constant 
across it. [Adapted from E. Wolf, Opt. Lett. 8 (1983), 250-252.] 
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Hence the degree of coherence of the light emerging from the filters is given by the 
expression 


i +) (Q v Q v r) 


r {+ \q v q v t) 

^■ + >(Q v Q v 0)jp + HQ 2 ,Q 2 ,0) 

/t !0 (fip6 2 >w a )©(T), 


where the spectral degree of coherence is given by Eq. (3) and 


( 10 ) 


/ |r(a»)pe ,U>T doj 

9 ( 7 -) = ^- 

l M 2 do> 


( 11 ) 


We will refer to the function 0(r) as the filter function. We note that, as a consequence of 
a well-known inequality for integrals, we have 


Max|0(r)| = 0(0) = 1. 


( 12 ) 


It follows from Eq. (12) that 

Max|70 , (Q 1 ,2 2 ,r)| = j/u (i HQ r Q 2 ,(o 0 ) |- (13) 

T 

In words: the maximum of the absolute value of the (temporal) degree of coherence of the 
light emerging from the filters behind the two pinholes is equal to the absolute value of the 
spectral degree of coherence at the central frequency m 0 of the light incident on the pinholes. 

Suppose that the averaged intensities of the light that is incident on the two pinholes are 
the same. Then, according to Eq. (19) of Section 3.1, the modulus of the degree of coher¬ 
ence 7 ! ' +) ( 2 i, Q 2 , t) is equal to the visibility of fringes formed by the light emerging from 
the two pinholes. Equation (13), therefore, implies that the maximum visibility of the 
fringes formed by the filtered light is equal to the modulus of the spectral degree of coher¬ 
ence pS'XQi, Qi. «o) °f the (unfiltered) light that is incident on the pinholes. Thus we see 
that reducing the bandwidth of the incident light by linear filtering will not produce sharper 
fringes. In particular the fringe visibility will not approach the value unity, irrespective 
of how narrow the passbands of the filters are (unless, of course, |/u w (<2i, Q 2 , w 0 >] = 1). 
However, it is clear from Eq. (10) and from a well-known reciprocity inequality relating to 
effective widths of a pair of functions that are Fourier transforms of each other [B&W. 
Eq. (32) of Section 10.8] that the following result holds: the narrower the passbands of the 
filters, the broader the absolute value of the filter function 0(r). Consequently, according 
to Eq. (10) and the relation (19) of Section 3.1 between |7 <+) (<2 i, Qs. r)| and the visibility, 
more fringes will then be formed. 
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Fig. 4.9 Results of measurements of the fringe visibility ‘1 \t) in a two-pinhole experi¬ 
ment against log l0 (Atn), where Aw denotes the bandwidth of the filtered light. The two 
sets of measurements pertain to sets of filters centered on the wavelengths A = 633 nm 
and A = 488 nm. The points on each sequence are joined by dotted lines for clarity. The 
measurements confirm that in the frequency ranges considered the fringe visibility is inde¬ 
pendent of the bandwidth of the light over very wide wavelength ranges. [Adapted front 
L. Basano. P. Ottonello, G. Rottingni and M. Vicari, Appl. Opt. 42 (2003), 6239-6244.] 


We may summarize the main result that we derived in this section by saying that we have 
shown that linear filtering does not increase the spatial coherence of light (i.e. no sharper 
fringes are formed) but it increases its temporal coherence (i.e. more fringes will become 
visible). These theoretical predictions were verified experimentally. 1 Some of the experi¬ 
mental results are reproduced in Fig. 4.9. 


PROBLEMS 


4.1 It was shown in Section 4.1 that the cross-spectral density function of a statistically 
stationary field occupying a finite domain D may be expressed in the form 

Mr,, r 2 , (o) = (C/*(r,. a»U{r 2 , m)), 

where 


U(r. a>) = Ea„(w)o„(r. co). 

The functions <^„(r, w) are the eigenfunctions of an integral equation whose kernel is 
the cross-spectral density function Mr,, r 2 , w). The u n (w) are random coefficients 
which satisfy the requirement that (a*a n ) = A „6 nm , 6„„, being the Kronecker symbol. 


L. Basano, P. Ottonello, G. Rottigni and M. Vicari. Appl. Opt. 42 (2003). 6239-6244. 
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Show that 

(1) (U(r h w)U(r 2 , co)) = 0; and 

(2) if ifi r) and are the real and the imaginary parts of U then 

(t/ (r -(r 1 , w)J/ r) (r 2 , <o)) = (t/’Vrj, «)} 

and 

(lA r \r h w)U^\r 2 ,co)) = «)t/ rt (r 2 , «)>. 

Find also expressions for the real and for the imaginary parts of the cross-spectral den¬ 
sity W in terms of correlation functions involving the real and the imaginary parts of U. 

4.2 (a) Derive a relation between the space-time degree of coherence yj 2 (r) and the spec¬ 

tral degree of coherence /r 12 («) of the field at two points P t and P 2 . 

(b) Suppose that the light is quasi-monochromatic and that the normalized spectra at 
the two points are equal to each other, i.e. that 

S}{(o) = s 2 (ojY, 

and also that p u is independent of frequency over the narrow bandwidth Aa> of 
the light. How does the relationship between 7 i 2 (t) and p i2 (co) simplify in the 
case when r <* 2-kIAwI 

4.3 A cross-spectral density of a planar, secondary source has the factorized form 

Mr), r 2 , oj) = F[(ri + r 2 )/2, w]G(r 2 - ty, to). 

Show that, in order that F(r, w) represents the spectral density and G(r', w) the spec¬ 
tral degree of coherence of light across the source region, the spectral density has to 
satisfy a certain functional equation. Show that the above formula applies with any 
spectral density distribution whose spatial dependence has the form 

S l0 >(p,«) = S (0) (.r, y, a>) = S l0, (0.0. w)e (J i j: ^- v) , 
where f3 j and /3 2 are constants. 

4.4 Consider two identical small sources, separated by a distance d. The spectrum of each 
source is Sq(co) and the correlation between them is characterized by //q(w). 

Derive an expression for the total power radiated by the two sources and discuss 
the limiting cases d < A and d > A (A = 2irc/w). Comment on the implications of the 
result for the case A > d on the overall behavior of the spectrum of the far field. 

4.5 Statistically stationary partially coherent light propagates from the input plane Z = Zo 
through a deterministic, time-invariant system, which is rotationally symmetric about the 
2 -axis, to the output plane 2 = 2 ). The system is characterized by an impulse response 
function K{p, />', w). The vectors p and p' are two-dimensional position vectors perpen¬ 
dicular to the 2 -axis, of points in the input plane and in the output plane, respectively. 
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4.6 


(a) If Wi, (p[, pi, co) is the cross-spectral density of the light in the input plane, derive 
an expression for the spectral density .S'jtp. oj) of the light at a typical point in the 
output plane. 

(b) Consider the special case of (a), when the input is a polychromatic plane wave 
which propagates in the positive-^ direction. 

Qt(w) = a(o))X(oj) + F(w), Q 2 (co) = /3(co)X(w) + F(u>) 

are sample functions, in the frequency domain, that represent the fluctuations of two 
small sources, located at points P\ and P 2 . X(oj) and F(w) are random mutually uncor¬ 
related functions, i.e. 

(X*MF(w)) = 0. 

and <■>(&>) and 0(oj) are deterministic functions such that 

|a(m)| = |j3(w)|. 

(a) Derive an expression for the degree of correlation /i 12 (w) of the two sources in 
terms of the spectra S x (oo) and Syico) of X(co) and F(w), respectively. 

(b) Derive an expression for the spectrum .S', < P, co) of field produced at a point P equi¬ 
distant from the two sources and sufficiently far from them when 

S x (oj) = S Y (co) = A 2 e^“ _a> o)‘ /l2o “) (A and u are positive constants) 

and 

|q(w)| = 1, o Q (a>) — = -cot, (t> l/(j), 


where <p a and 0 i3 are the phases of a and 3 respectively. Sketch S a (P, co) as a func¬ 
tion of cu. 
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Radiation from sources of different 
states of coherence 


5.1 Fields generated by sources with different coherence properties 

Light generated by sources of different states of coherence may exhibit very different 
behavior. This is quite evident from some simple examples, as we will now show. 

Suppose first that the light originates in a thermal source, e.g. it is emitted by a hot body. The 
radiant intensity, JJ s), which represents the rate at which power at frequency w is radiated by 
the source per unit solid angle around a direction specified by a unit vector s (which makes an 
angle 6 with the normal n to the source plane), is given by Lambert’s law [Fig. 5.1(a)] 

JJO) = JJO)cosO. (1) 

This is a rather broad angular distribution, illustrated on a polar diagram in Fig. 5.1(a). On 
the other hand, light generated by a single-mode laser will be very directional [Fig. 5.1(b)]. 
Practically all the laser light will be concentrated within a very narrow solid angle around 
the forward direction and, consequently, the polar diagram of the radiant intensity J w (s) 
will now have a needle-like form. 

Apart from this obvious difference between the radiation originating in these two kinds 
of sources there is a more subtle difference. It becomes evident if one considers the depend¬ 
ence of the radiant intensity on the shape of the source. It is clear from Eq. (1) that the radi¬ 
ant intensity of the light generated by a Lambertian source is independent of the shape of 
the source, being just proportional to cos (9; it is, therefore, rotationally symmetric about 
the normal 8 = 0 to the source plane. On the other hand, in view of the well-known 
Fourier-transform relationship between the light distribution in the far zone and the light 
distribution across the source plane of a spatially coherent source (usually the aperture 
plane), the radiant intensity generated by such a source depends strongly on the shape of 
the source. Consider, for example, a uniform circular source [Fig. 5.2(a)]. It will generate 


Examples of modern Lambertian sources, i.e. sources which obey Lambert’s law. are light-emitting diodes. See, 
for example. E. F. Schumba, Light-Emitting Diodes, second edition (Cambridge University Press, Cambridge, 
2006), Section 5.5. 
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m 



source polar diagram of radiant intensity J(0) 

Fig. 5.1 Comparison of the angular distributions of the radiant intensity J(9) of light 
produced by a thermal source (hot body) (a) and a single-mode laser (b). 





far-field intensity pattern 


Fig. 5.2 Illustrating the effect of change of shape of a spatially coherent planar source 
on the far-zone intensity pattern. 


a rotationally symmetric intensity distribution, with circular contours [Fig. 5.2(b)]. 
Suppose that the source is “stretched” in the y direction [Fig. 5.2(c)]. Then the far-zone 
intensity distribution will shrink in that direction [Fig. 5.2(d)]. The difference between the 
two very different kinds of sources is evidently due to the difference in their coherence 
properties, the Lambertian source being spatially highly incoherent (see Section 5.5), 
whereas the laser is spatially highly coherent. 

There are, of course, other important differences between beams of light generated by these 
two kinds of sources. As we have seen earlier (Section 2.1), the probability distributions which 
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Planar secondary 
source a 


Fig. 5.3 Illustrating the notation relating to a planar, secondary source a. 


govern the fluctuations of the light field in the two cases are quite different. Another difference 
is in the average number of photons in a cell of phase space (the coherence volume). The num¬ 
ber is called the degeneracy parameter of the light and is discussed in Appendix I. 

In this chapter we will study radiation properties and coherence properties of fields gen¬ 
erated by sources of different states of spatial coherence. We will mainly deal with fields 
generated by planar, secondary' sources, which are of particular interest in many applica¬ 
tions, e.g. in instrumental optics. The corresponding results pertaining to three-dimensional 
primary sources are very similar (M&W, Section 5.2). 

5.2 Correlations and the spectral density in the far field 

Let us consider a planar, secondary source a of finite size, assumed to be statistically sta¬ 
tionary. at least in the wide sense. Such a source may be an opening, for example, in an 
opaque screen illuminated either directly or via an optical system. 

According to the coherence theory in the space-frequency domain that we studied in the 
preceding chapter, the cross-spectral density function of the field at a pair of points 5, and 
S 2 in the source plane (see Fig. 5.3) may be expressed in the form [Eq. (13) of Section 4.1. 
with the subscript w omitted from now on] 

W (0) ( P ;, p), to) = {uW (pi, to) U m (p). to)). (1) 

Here p] and p' 2 are two-dimensional position vectors specifying the locations of the two 
points, with respect to an origin O in the source plane z = 0, and the superscript zero indi¬ 
cates that the quantities pertain to points in that plane. If- 0 ’ (p\ to) represents, of course, a 
member of a statistical ensemble of the frequency-dependent realizations and the angular 
brackets denote the average over that ensemble. 

Let us now consider the cross-spectral density function VF(r|, r 2 , to) of the field at any 
pair of points .Pjfr,) and / > 2 (r 2 ) in the half-space z > 0 into which the source radiates (see 
Fig. 5.4). It may be expressed by the same formula as W <0 \ viz.. 
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Wi r . r 2 , co) = {if (r,, oo)U( r 2 , co)). 


( 2 ) 


where {(/(r, co)} represents the ensemble of the field at the point r generated by the field in 
the source plane z = 0, U( r, w) may be expressed in terms of the “boundary field" ( 7 i0) (p', co) 
by the first Rayleigh diffraction integral [M&W, Eq. (3.2-78)] 1 


U(r co) =-— f U (0, ( p, co) 

2tt J c r 


d 

t' kR 

dz 

R 


d y. 


(3) 


where 


R = r - p'|. (4) 

We will consider the far field, i.e. the field at points P which are at very large distances 
from the origin, which is taken to be in the source region. To evaluate U at such points, it is 
convenient to set r = rs (s 2 = 1). Evidently, for sufficiently large distances r, 

R ~ r — s • p . (5) 

where s • p'denotes the projection 0/Vof the distance OQ onto the s direction (see Fig. 5.5) 
and hence 


1 When the source is a primary three-dimensional source, rather than a two-dimensional secondary one, with 
source distribution Q( r, to) occupying a finite domain D, one has in place of Eq. (3) the formula 

/ « pit/? 

Q(r\ co) -dV, 

d R 

where R = |r — r'|. For proof of this result see, for example, C. H. Papas, Theory of Electromagnetic Wave 
Propagation (McGraw-Hill, New York, 1965), Section 2.1. 
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secondary 
source a 


Fig. 5.5 Illustrating the approximation (5). valid for large distance r. 


Consequently 


s ikR 

R 


e itr 


Q—iks-p’ 


r 


d e liR 


ik 




( 6 ) 


(7) 


On substituting from Eq. (7) into the Rayleigh diffraction integral (3) and noting that 
z/r = cos 6 we obtain the following expression for the value of U in the far zone, which we 
will denote by U^ m h 

f* i 

U {=c) (rs,w) ~ -—cos 6 - I f/ (0, (p . w)e _iis p d 2 p'. (8) 

27T r J cr 

Here 9 denotes the angle which the s direction, pointing from the origin in the source plane 
to the observation point P = rs in the far zone, makes w ith the "-axis (i.e. with the norma] 
to the source plane). 

It will be convenient to introduce the two-dimensional Fourier transform {/ (0) (f, oj) of 
[/ 0) (p\ <w), viz.. 


t/ ( 0 Hf.w) = -— f U l0 Hp d 2 p\ (9) 

(27r) 2 J 

(z—0) 

where f is a two-dimensional spatial-frequency vector. Although formally the integral on the 
right-hand side of Eq. (9) extends over the whole plane z = 0 containing the secondary 
source, it is actually taken only over the source region pr, because C 0) (p'. w) = 0 outside a. 
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With the definition of (f 0 ' 1 given by Eq. (9), the formula (8) may be rewritten in the more 
compact form 

gi kr 

U (M )(rs,co) - 2irik cos9U {0) (ks , co) -, (10) 

r 

where s x is the projection, considered as a two-dimensional vector, of the three-dimensional 
unit vector s onto the source plane z = 0; i.e. if we write s = (s T , .v,„ sj then s x = (s x , s y , 0). 

According to Eq. (2 ) the cross-spectral density at a pair of points in the far zone speci¬ 
fied by position vectors r, = /qs, and r 2 = r 2 s 2 (sf = ss = l ) is given by the formula 

W <m (r,s b r 2 %, w) = {C/ (30) * (rjS,, w)£/ ,5C) (r 2 .s^), to)). (11) 

On substituting from Eq. (10) into this formula we see at once that 


Wl“HrjS r r 2 s 2 , W ) 


e i/c(r,-rj) 

(Ink) 2 cosft cos0,{L f(O »*(/ts, .w)f/ (0 HA-s, ,,w))---. (12) 

r \ r 2 


where, of course, 9 , and 0 2 are the angles which the unit sectors Sj and s 2 make with the posi¬ 
tive z-axis. On the right of the formula (12) there is the av erage of the product of the two-dimen¬ 
sional Fourier components of if®. We will now show that this average may be expressed as a 
four-dimensional Fourier transform of W' h . We have, on usmg the definition (9) of U {0 \ that 


(U m \f v w)U^Ht 2 ,to)) 


a tt) 


l — JJ (U ,0 '*(pl.oj)U«>'(p:.to) e“ ilf :A- f r p ’d 2 p| d 2 ^. (13) 


(z=0) 


Let us introduce the four-dimensional Fourier transform of VV' l0) , viz.. 

1 


ff W< 0l (p;.p:. w)e" i( VPi' + Lp))d 2 p ; d 2 ps. (14) 

(27r) 4 JJ 


(z=0) 


Now according to Eq. (1) the expectation value under the integral sign in Eq. (13) is just the 
cross-spectral density W^YpJ, pj, oj) of the field in the source plane and hence the right-hand 
sides of (13) and (14) are equal to each other, apart from the difference in the sign of ^ in the 
exponent. Taking this difference into account, the left-hand sides of these equations will also 
be equal to each other and hence 


{{/ (0 >*(f p (u)£/< 0 >(f 2 , w )) = W<o>(-fj,f 2 oj). (15) 

Finally, on substituting from Eq. (15) into Eq. (12) we obtain the following expression 
for the cross-spectral density of the far field: 


gi Hr 2 -r,) 

IW^qSp/gs^.w) = (27rk) 2 W ( °l(— ks, ,ks 2± ,(o) -cos Q i cos(16) 

r \ r 2 
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This is a basic formula from which various properties of the far field generated by planar, 
statistically stationary sources of any state of coherence may be deduced. We will study 
them later in this chapter. 

The formula (16) is an analogue in second-order coherence theory' of the Fraunhofer for¬ 
mula of the elementary theory' of diffraction of monochromatic light. It implies that, apart 
from simple geometrical factors, the correlation of the radiated field at a pair of points in 
the far zone, in directions specified by unit vectors Sj and s 2 , is given by a particular four¬ 
dimensional spatial-frequency component W 01 (f,. f 2 , co) of the cross-spectral density of 
the light in the source plane, namely one for which the two-dimensional spatial-frequency 
vectors are f, = — ks l± and f 2 = ks 2 ±. Since s l± and s 2 are components of unit vectors it 
can be seen at once from Eq. (16) that only those spatial frequency vectors for which 
(fj j < k and |f 2 | ^ k of W'' 0) contribute to the cross-spectral density function U /,X| of the far 
field. We will call them low-spatial-frequency components. 

The spatial-frequency component for which the opposite inequalities hold. i.e. for which 
jfi | > k and |f 2 > k may be called high-spatial-frequency components. There are, of course, 
also “mixed" situations when jf, j > k and |f 2 ] £ k or |fj| k and Jf 2 | > k. The basic formula 
(16) shows that only the low-spatial-frequency components of the source contribute to the 
far field. The high-spatial-frequency components give rise to evanescent waves, (see. for 
example, M&W, Section 3.2), which decay exponentially in amplitude with increasing dis¬ 
tance from the origin and, consequently, do not contribute to the far field. 

As already mentioned, one may derive from formula (16) various properties of the far 
field. In particular let us consider the spectral density S (cc) (rs, w), which is also called the 
optical intensity at frequency to, of the field at the point /’(rs) in the far zone. It is given by 
the expression 


S^\rs, oj) = (t7< x) *(rs, oflf^irs. w)) = W <x) (rs, rs, w) (17) 

or, using the formula (16), 


S (x) (>s,w) 


2irk 

r 


VT <0) ( — A'S , ks ,a>)cos 2 6. 


(18) 


We note the inverse-square-law dependence of S {x> on the distance r from the source, a 
result reminiscent of the inverse square-law of elementary wave theory. It is to be noted that 
the angular dependence of the spectral density is not only given by the proportionality fac¬ 
tor cos 2 !) but also depends on the spatial coherence properties of the source, through the 
dependence of W 0) on the directional vector s. 

Because the distance from the source enters (18) only through the factor \lf it is con¬ 
venient to set 


S^Hrs.w) 


JJ& 


(19) 







86 


Sources of different states of coherence 


Element dfl of 



source a 


s 


Fig. 5.6 Illustrating the meaning of the radiant intensity It represents the rate at 
which energy at frequency w is radiated into the far zone, per unit solid angle dfl around 
the direction specified by a unit vector s. 


where, evidently, 


JJs) = (2-k) 2 W i0> (-ks ,ks .w)cos 2 6. 


( 20 ) 


The function Jj s I is know n as the radiant intensity at frequency co or, more precisely, as 
the spectral radiant intensity and is a generalization of a quantity bearing the same name in 
traditional radiometry which deals with light from spatially incoherent sources. In suitable 
units the radiant intensity is a measure of the pow er radiated by the source per unit solid 
angle, around the direction specified by the unit vector s. per unit frequency interval cen¬ 
tered at the frequency w (see Fig. 5.6). 

Another quantity of interest relating to the far field is its spectral degree of coherence 
^(nSi, r 2 s 2 ’ 0J >- According to the general formula (6a) of Section 4.2 it is given by the 
expression 





( 21 ) 



Apart from a simple phase factor, the right-hand side of this formula may be expressed in 
terms of the cross-spectral density function of the source by the use of formula (16). One 
then finds that 



. ks 21 ,w) 



(22) 
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Fig. 5.7 Illustrating longitudinal spectral coherence (a) and transverse spectral 
coherence (b) in the far zone. 


We note that depends on the distances /', and r 2 from the origin of the two points 
P\{r\ Sj) and P 2 (r 2 s 2 ) in the far zone only through the phase factor k(r 2 — r x ). 

Two consequences of Eq. (22) are of special interest. When the points P\ and P 2 in the 
far zone are located in the same direction, i.e. when s 2 = s h the first factor on the right- 
hand side of Eq. (22) has the value unity and one then has 

p^fr^s.ps.w) = (23) 


Consequently 


|(r ( * ) (r | s,7- 2 s.co)| = 1. (24) 

This formula implies that along any direction pointing from the source, the far field is spa¬ 
tially completely coherent at each frequency «, a result which may be expressed by saying 
that the far field has complete longitudinal spectral coherence at each frequency, irrespec¬ 
tive of the state of coherence of the source [Fig. 5.7(a)]. 

Let us next consider the situation in which the two points in the far zone are located at 
the same distance, r say, from an origin in the source region, in directions s, and s,. The 
spectra] degree of coherence //"’(rSj, rs 2 , <u) may then be said to represent transverse 
coherence. It follows at once from the general formula (21) that the transverse degree of 
coherence of the light in the far zone is independent of the distance r and depends only on 
the directions Sj and s 2 [Fig. 5.7(b)], 
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5.3 Radiation from some model sources 


To gain some insight about properties of fields radiated by sources of different states of 
coherence it is convenient and useful to consider fields that are produced by certain types 
of model sources. Some of these sources represent, at least approximately, sources that are 
frequently encountered in nature or that can readily be produced in a laboratory. Moreover, 
the fields which they generate may often be analy zed by the use of relatively simple math¬ 
ematics. A broad class of such model sources is formed by the so-called Schell-model 
sources , which we will consider first. 


5.3.1 Schell-model sources 


A planar secondary' source of this class is characterized by the property that its spectral 
degree of coherence p ( 0 ) (Pi. P2, tu) depends on the location of the two points P \ and P 2 only 
through the difference p ; — pi, of their position vectors p, and p 2 We will then denote the 
spectral degree of coherence of the light in the source plane by p l0l (pj — pj. w) rather than 
by p (0) (pi, p 2 . oj). Recalling the definition of the spectral degree of coherence [Eq. (6b) of 
Section 4.2], it follows that the cross-spectral density function of a planar, secondary 
Schell-model source has the form 


W 0, (pj.P;.w) = y[s w (pj,(Pj,a>)p >(p; - pj.w). 


( 1 ) 


An expression for the radiant intensity of the field generated by a source of this kind may 
readily be calculated on first taking the four-dimensional spatial Fourier transform of 
expression (1) and substituting it into the general formula [Eq. (20) of Section 5.2]. On 
changing the variables of integration in the Fourier transform by setting pj + p ' 2 = 2p and 
p 2 — p[ = p' one obtains the following expression for the radiant intensity generated by 
such a source: 



where 


H<°H p'.w) = j V5 (0, (P + p’/2,a>)JsW(p ~ p'/2,w) d 2 p. (3) 


As an example let us suppose that the source is circular and of radius a and that both the 
spatial distribution of its spectral density across it and also its spectral degree of coherence 
are Gaussian, i.e. that 



when p < o 
when p > a 


(4) 
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and 






(5) 


where the quantities A, a s and are taken to be independent of position but. in general, 
depend on the frequency. Usually a S> cr s . We will assume this to be the case, which simplifies 
subsequent calculations. Such sources are known as Gaussian Schell-model sources and the 
fields which they generate are called Gaussian Schell-model fields They have been studied 
extensively in the analysis of various partially coherent fields and in connection with propaga¬ 
tion of beams through the turbulent atmosphere where fields of this kind, which have beam-like 
form, are called Gaussian Schell-model beams. We will encounter such beams in Chapter 8. 

On substituting from Eqs. (3)-(5) into Eq. (2) one finds after a long but straightforward 
calculation that 


H<°'(p\ w) = 27rAV 2 eV M8 ^>, (6) 

and 

JJ9) = JJ0)cos 2 ec-i mhin2e , (7) 


where we have now written JJ9) rather than s), 9 denoting, as before, the angle which 
the unit vector s makes with the positive z-axis; and we assumed not only that a > cr s but 
also that a > o lr In Eq. (7) 

JM = (MoyS) 2 (8) 

and 


6 2 


_J_ 

< 2^) 2 


4- 



(8a) 


We see from Eq. (5) that /r (0) (p', w) —> 1 as cr ;( —»°o. The source is then fully spatially 
coherent, but this limit has to be interpreted with caution. For our earlier assumption that 
a > cr„ demands that the source diameters must also become infinite, in such a manner that 
the ratio a/er„ tends to a finite limit which is large relative to unity. Now according to Eq. 
(4) the spatial intensity distribution across the source has a Gaussian form. This is clearly 
the same situation as one encounters when the source is a laser operating in its lowest-order 
Hermite-Gaussian mode. In this case (tr„ —> do) Eq. (8a) implies that 


<5 = 2rr s . (9) 

Moreover, we now have kS = 2ir(8/\) = 4 tto s /X and. for a realistic laser, this parameter 
will be much greater than unity. The exponential term in Eq. (7) will then have a non- 
negligible value only when sin 9 < \. The expression (7) for the radiant intensity JJ8) 
generated by such a source may then be approximated by the expression 
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JJ.9) = JjO),-i (kS)2e2 . (10) 

Evidently the angular spread of the radiation from such a source is confined to a domain of 
semi-angle 9 ~ l/(k6) = X/(2n6) ~ AKAtut^, in view of Eq. (9). In this case the radiation 
is effectively confined to a very narrow angular domain, i.e. under these conditions the 
source generates a beam - namely a Gaussian beam, with a beam waist w 0 = 6 = 2u s . Thus 
we have shown that Gaussian Schell-model sources are generalizations of laser sources 
operating in their lowest-order Hermite-Gaussian mode. 

Let us now consider the other extreme case, namely when cr —> 0. This obviously rep¬ 
resents the incoherent limit (zero spectral correlation length). In this case we have from 
Eq. (8a) that 6 —■* 0. In order for the factor J m ( 0) given by the formula (8) to remain finite 
and non-zero, with o s fixed, it is necessary that A cc in such a way that the product AS 
remains finite. Expression (7) then reduces to 

U0) = JJ0)cos 2 0. (11) 

Hence in the incoherent limit the radiant intensity falls off with 9 as cos 2 #. Since, for a 
Lambertian source, = JJ. 0)cos#. this result implies that a spatially strictly incoherent 
planar, secondary source cannot be Lambertian, at least when the incoherent source is mod¬ 
eled as a limiting case of a Gaussian Schell-model source. One might, therefore, suspect that 
a Lambertian source is not completely spatially incoherent. We will later see (Section 5.5) 
that this is indeed the case. 


5.3.2 Quasi-homogeneous sources 

An important sub-class of Schell-model sources (again assumed to be planar, secondary 
sources) is constituted by so-called quasi-homogeneous sources. For such sources the spec¬ 
tral density .S’ llll (p. w) changes much more slowly with p than the spectral degree of coher¬ 
ence /i (0) (p', to) changes with p' = p' 2 — p,'. One says that S'^lp, to) is a “slow” function of 
p whereas fiP\ p', w) is a “fast" function of p Such behavior is illustrated in Fig. 5.8. 
Further, the linear dimensions of sources of this kind are assumed to be large relative to the 
wavelength A = 2txcI<j) of the light. 

For a quasi-homogeneous source the expression (1) for the cross-spectral density simpli¬ 
fies. It may evidently be approximated by the formula 


W (0 Hp 1 '.p 2 , ,to) ss s (0) 


Pi' + fb 


. to 


^ (0) (p( - p,',to). 


( 12 ) 


The fact that W (0) now has such a factorized form leads to considerable simplification in 
further analysis. 

To derive an expression for the radiant intensity and for the spectral degree of coherence 
of the far field generated by a planar, quasi-homogeneous source we must first determine the 
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Wp, w) 

i 



5 (Pi + Pi) 


spectral density (intensity) 
(slow function) 


>-P 


|p (0, (p'. ")l 



spectral degree of coherence 
(fast function) 



area of coherence 


source 


Fig. 5.8 Illustrating the concept of a quasi-homogeneous source. The effective spectral 
coherence area of the source is much smaller than the region of the source over which 
the spectral density (intensity) changes appreciably. 


four-dimensional spatial Fourier transform of the expression (12). For this purpose we intro¬ 
duce the same new variables as we did before in the transition from Eq. (1) to Eq. (2). viz., 

P = |(Pt' + P: b>. p' = P:' - Pi'- (13a) 

We have the inverse relations 

Pi' = P ~^rp', p( = p + tp' (13b) 

and one readily finds that the four-dimensional Fourier transform IF 01 , defined by Eq. (14) 
of Section 5.2, of the cross-spectral density function (12) has the factorized form 

w) = S ( 0 »(f, + f,,«)q (0 > l(f 2 - f,),m 


(14) 
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where 5 (n ' and /T°' are the two-dimensional Fourier transforms of S ,0) and of p m K respectively: 


5 (0 >(f,£u) - ' , 

(2?r)“ « 

j S t0) ( p,oi)e lfp d 2 p, 

J (z=0j 

(15a) 


f /z (0) (p . «)e“ lf p d 2 p'. 

(15b) 

(z=0) 



The formula (14) shows that for a quasi-homogeneous source the four-dimensional Fourier 
transform of the cross-spectral density of the source distribution factorizes into the product 
of two two-dimensional Fourier transforms, one involving the spectral density and the 
other involving the spectral degree of coherence of the source. Using Eq. (14), the radiant 
intensity and the spectral degree of coherence of the far field [Eqs. (20) and (22) of Section 
5.2] may readily be evaluated. One finds that the radiant intensity of the field generated by 
a quasi-homogeneous source is given by the expression 

JJ6) = (27rfc) 2 S (0, (0. w)/Z (0) (ks L , o))cos 2 (9. (16) 

where the factor 

S«b(0,w) = —f S i0> (p'. <a)& 2 p (17) 

(27T,) 2 J (j=0) 

is proportional to the integral of the spectral density of the source, taken formally over the 
entire source plane. 

Equation (16) shows that the angular distribution of the radiant intensity depends on the 
product of cos 2 # and of the two-dimensional spatial Fourier transform of the spectral 
degree of coherence of the light distribution across the source. The spectral intensity distri¬ 
bution across the source, i.e. the spectral density .S’" ll (p. a>), enters only through the factor 
S (0) (0, oj) given by Eq. (17). Hence the angular distribution of radiant intensity generated 
by a quasi-homogeneous source is independent of the shape of the source ; it is essentially 
determined by the spatial-coherence properties of the source, represented by the spectral 
degree of coherence p (0 \ This result explains a fact we noted earlier, in Section 5.1, namely 
that the angular distribution of the intensity throughout the far zone generated by 
Lambertian sources (which, as we will learn in Section 5.5, are quasi-homogeneous) is 
independent of the shape of the source. 

The far-zone coherence properties of the radiation produced by a planar, secondary 
quasi-homogeneous source may also be readily determined. We see from Eq. (14) that 

W (0, (—£s u , ks 2± , w) = S (0, [A-(s 2 -s l± ),(i>]p i0) yk(s lL + s 21 ),<o . (18) 

and there are two similar expressions for the two terms appearing in the denominator of 
Eq. (22) of Section 5.2 for the spectral degree of coherence. On substituting from Eq. (18) 
into that equation we find that 
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M 


(x) (r,s 


ri’ 


S (0) [k(s,, ± - s, ),w] - „ iWr . 

,,w) = -=r^-—-G t0) (fe,, ,ta, | 


S (0) (0,m) 


n> 


(19) 


where 


G (0) (fe 1± ,fe, ± ,ftj) = 


A (0) [iA'(s lx + s 2± ),w 
^Jp m (ks ]± ,w)^p (0) iks 2X ,w) 


( 20 ) 


Now, for a quasi-homogenous source, the spectral density S l 0 | (p. co) is a “slow" function 
of p whereas p ( 0 ) (p', co) is a “fast” function of p'. According to a well-known reciprocity 
relation involving Fourier-transform pairs, 5 <0) (f. co) will, therefore, be a “fast” function of 
f and /7 ,0, (f\ co) will be a “slow” function of f. Using these facts, the expression on the 
right of Eq. (20) may be approximated by unity and the expression (19) for the spectral 
degree of coherence of the far field reduces to 


fi (x) (i]S r r 2 s 2 ,co) 


S (0 '[k(s 2± -s u ),ft)] 

^°>(0,w) 


gi k(r 2 —r t ) 


( 21 ) 


This formula shows that, apart from a simple geometrical phase factor, the spectral degree 
of coherence /i l “ ) (r 1 s ] , r 2 s 2 , co) of the far field generated by a planar, secondary quasi- 
homogeneous source is equal to the normalized spatial Fourier transform of the spectral 
density across the source. 

Except for the difference in notation, the formula (21) is of exactly the same form as the 
far-zone form of the van Cittert-Zemike theorem [Eq. (20) of Section 3.2] for the equal¬ 
time degree of coherence '/^(‘ r i s 1 j r 2 s 2 - 0) s /“W iSj, r 2 s- 2 j, which pertains to quasi- 
monochromatic radiation from a spatially incoherent source. Clearly the formula (21) may 
be regarded as its generalization, in the space-frequency domain, to the far field generated 
by quasi-homogeneous sources. It should be noted that such sources may have large coher¬ 
ence areas. 

We will now examine more closely the two main formulas which we derived in this sec¬ 
tion. namely Eqs. (16) and (21), taking for simplicity r 2 = r, in Eq. (21). W 7 e may express 
these two formulas in the form 

JJfi) = (2Trk) 2 Cp <0 \fs x , a>)cos 2 0. (16a) 


fj,^Hr l s l ,r 2 s 2 ,to) = ^S (0] [k( S| A - s 1± ),w], (21a) 

where the factor C = .S’ (0 YO. co) is given by Eq. (17). This pair of formulas brings into evidence 
two interesting reciprocity theorems relating to radiation from planar quasi-homogeneous 
sources. The first shows that the angular distribution of the radiant intensity : primarily 
depends on the two-dimensional spatial Fourier transform of the spectral degree of coher¬ 
ence of light across the source and, as already rioted, is therefore independent of the shape 
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of the source. The second shows that the spectral degree of coherence of the far field is pro¬ 
portional to the two-dimensional spatial Fourier transform of the spectral intensity distribu¬ 
tion (the spectral density) across the source. These two results are indicated schematically 
in the following diagram, in which the arrowed lines indicate Fourier-transform pairs: 


SOURCE PLANE FAR ZONE 


Spectral Density 
(Optical Intensity) 

. 

Radiant Intensity 




Spectra] Degree of 

4 . 

Spectral Degree of 

Coherence 


Coherence 


We will illustrate these results by a simple example. Consider radiation from a second¬ 
ary, quasi-homogeneous uniform circular source of radius a > A whose spatial distribution 
of the spectral intensity and also spectral degree of coherence are Gaussian, viz., 


S (0) (p,«) = 


A : Q-p 2 ri2fi) 

0 


when p < a. 
v. hen p > a- 


( 22 ) 


and 


lf 0) (p . w) = e (23) 

where the parameters cr s and <r (l depend on frequency, in general. The calculations simplify 
if we assume that the linear dimensions of the source are large relative to the r.m.s. width 
o s of the spectral density function .S' 10 - 1 , i.e. if a > n.. Because the source is assumed to be 
quasi-homogeneous, we also have a 8> ny. Under these circumstances one readily obtains 
from the two reciprocity relations (16a) and (21a) the following expressions for the radiant 
intensity and spectral degree of coherence of the far field generated by such a source: 


JJ9) = (kAo^c J s ) 2 cos^e 

(24) 

p l<x,) (r lSv r 2 s 2 ,m) = 

(25) 


where 


Ml 2 — 1*21 SjJ. (26) 

Figure 5.9 shows the polar diagram of the normalized radiant intensity JJ0)UJ{)). cal¬ 
culated from Eq. (24), for radiation from sources with various values of the r.m.s. width. 








5.4 Sources which generate identical distributions 


95 



Fig. 5.9 Polar diagrams, calculated from Eq. (24) of Section 5.3, of the normalized 
radiant intensity from a Gaussian-correlated quasi-homogeneous source, for various 
values of the r.m.s. width <r ;t of the spectral degree of coherence p. The length of the 
vector pointing from the origin to a typical point on a curve labeled by a particular value 
of the parameter ka^ represents the normalized radiant intensity in the direction of that 
vector. [Adapted front E. Wolf and W. H. Carter. Opt. Conunun. 13 (1975), 205-206.) 


tr #1 , of the spectral degree of coherence //, We see that for small values of far /t the radiation 
is spread over a wide solid angle but that as kn^ increases, i.e as the source becomes more 
spatially coherent, the radiation becomes more directional, eventually forming a beam. For 
comparison, the corresponding polar diagram for radiation from a Lambertian source is 
included. Figure 5.10 illustrates the behavior of the spectral degree of coherence of the 
light in the far zone, calculated from Eq. (25). It shows that with increasing effective source 
size ( ka s increasing) the angular region over which the absolute value of the degree of 
coherence is appreciable becomes narrower. 

5.4 Sources of different states of spatial coherence which generate 
identical distributions of the radiant intensity 

It is frequently asserted that, in order for a source to generate a highly directional field, i.e. 
a narrow beam of radiation, it must be spatially highly coherent, such as. for example, a 
well-stabilized laser is. This, however, is not true. We will see that under appropriate con¬ 
ditions partially coherent sources of different states of coherence may produce beams which 
are just as directional as laser beams. More specifically we will show that sources of differ¬ 
ent states of coherence may produce fields with the same far-zone intensity distribution, i.e. 
with the same distribution of the radiant intensity. 
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Fig. 5.10 The behavior of the absolute value of the spectral degree of coherence, given 
by Eq. (25) of Section 5.3, of the far field generated by a quasi-homogeneous, Gaussian, 
planar, secondary source. The variable u l2 is defined b\ Eq. (26) of Section 5.4. [Adapted 
from W.H. Carter and E. Wolf. J. Opt. Soc. Amer. 67 (1977), 785-796.] 


It follows from Eqs. (14) and (20) of Section 5.2 that the radiant intensity of a field gen¬ 
erated by a planar, secondary, statistically stationary source may be expressed in the form 


= 


2tt 


cos 2 # 


JJ* Vf'W(p ] , ,p 2 ', w )e- ,i 'V ^ v!) d^'d 2 ^'. 


( 1 ) 


Let us express the cross-spectral density W"" of the held in the source plane in terms of the 
spectral density and the spectral degree of coherence of the light in that plane [Eq. (6b) of 
Section 4.2], The formula (1) then becomes 


U») = 


2ir 


cos 2 6 j'J yj S {0) (p, 7 , (O) yj (p2, (p,', PS, cu)e lks ^ <p 2 &d 2 p[d 2 f£. 


( 2 ) 

Equation (2) shows that both the spectral intensity distribution 5^ 0> (p, to) and the spectral 
degree of coherence /i ,0) (p], pi, w) of the light in the source plane contribute to the radiant 
intensity. The possibility exists that two different sources, one with spectral distribution 
S^(p\ cd) and spectral degree of coherence //["(p j. p), a>) say, and the other with distribu¬ 
tions S$(p' t cd) and fityfp 1, pi. cd), may generate the same radiant intensity JJs). To put it 
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differently, it seems plausible that there might be a “trade-off’ between the spectral density 
and the spectral degree of coherence in such a way that the two sources would give the 
same values of the integrals in Eq. (2) and, consequently, generate the same radiant inten¬ 
sity. It turns out that this is indeed possible. A simple example is provided by the Gaussian 
Schell-model sources which we discussed in Section 5.3. It follows at once from Eqs. 
(6)—(8) of that section that two sources of this kind, for which the r.m.s. widths cr, of the 
spectra] density and the r.m.s. width <r ; , of the spectral degree of coherence are such that the 
quantity 8, defined by Eq. (8a) of Section 5.3, viz.. 

1 11 

77 = O + (3) 

6 ( 2 ^)- a* 

has the same value, will generate fields with the same relative angular distribution of the 
radiant intensity. Moreover, as is evident from Eqs. (7) and (8) of Section 5.3. if the factors 
A [see Eq. (4) of Section 5.3] are such that the expression 

JJO) = ( kAa s 6 ) 2 (4) 

also has the same values, then not only the relative but also the actual values of the radiant 
intensities will be the same. 

The fact that for two such “equivalent” sources the sum of the two terms on the right- 
hand side of Eq. (3) has to be the same indicates the trade-off between the contributions of 
the spectral density and of the spectral degree of coherence. Some computed curves show¬ 
ing such a trade-off are shown in Fig. 5.11. Experimental verifications of these theoretical 
predictions are illustrated in Fig. 5.13, obtained by use of the system shown in Fig. 5.12. 

In Fig. 5.14, the changes of beam radii for propagation from sources with the same ini¬ 
tial r.m.s. radius er s = 0.1 cm but with different spectral degrees, of coherence (a) and from 
sources with the same r.m.s. spectral degree of coherence but with different r.m.s. w idths cr s 
of the intensity distribution (bt are shown. Figure 5.14(c) illustrates the equivalence theo¬ 
rem. There is a trade-off between spatial coherence of the source and the spatial intensity 
distribution across the source, in accordance with Eq. (3), resulting in beams which have 
the same angular spread in the far zone. 


5.5 Coherence properties of Lambertian sources 

We will now return to one of the reciprocity relations which we derived in Section 5.3 and 
make use of it to elucidate the spatial coherence properties of Lambertian sources. 

According to the first reciprocity relation, given by Eq. (16) of Section 5.3. the radiant 
intensity of a field generated by a planar, secondary, quasi-homogeneous source is given by 
the expression 


JJs) = (27rA') 2 5 <0) (0, a))fi {0) (ks ,w)cos 2 9, 


(1) 
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Fig. 5.11 Illustrating the behavior of the spectral degree of coherence and the spectral inten¬ 
sity distribution across three partially coherent sources [(b). (c), (d)] which produces fields 
whose far-zone intensity distributions are the same as that generated by a coherent laser 
source [(a)]. The parameters characterizing the four sources are (a) er,, = oo, a s — 1 mm, 
A = 1 (arbitrary units), (b) er (1 = 5mm. = 1.09 mm, A = 0.84. (c) cr= 2.5 mm. 
a s = 1.67 mm, A = 0.36 and (d) er ; , = 2.1 mm. a s = 3.28 mm. A = 0.09. The normalized 
radiant intensity generated by all these sources is JJf))IJJ0) = cos^expf-^fa^sin 2 #], 
(cr s = 1 mm). [Adapted from E. Wolf and E. Collett. Opt. Conmiun. 25 (1978), 293-296.] 


where 5 , °(0. w) is defined by Eq. (17) of that section. For a Lambertian source, 

4(S) = 4 01 cos ^ 

where denotes the radiant intensity generated by the source in the direction 9 = 0. 


(2) 
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Fig. 5.12 A system that was used to test that sources with different spatial coherence 
properties can generate identical angular distributions of the radiant intensity. L t . Li. L 3 
and L 4 are lenses, F is an amplitude filter, G is a rotating ground glass plate. PH is a photo¬ 
detector. [Adapted front P, DeSantis. F. Gori, G. Guattari and C. Palma. Opt. Commun. 
29 (.1979), 256-260.] 



Fig. 5.13 (A) Intensity distributions across a coherent laser source (a) and across an 
“equivalent” partially coherent source (b). (B) The measured angular distribution of 
intensity, / (arbitrary but same units) in the far zone of fields generated by the two 
sources shown on the left. [Reproduced from P. DeSantis, F. Gori, G. Guattari and 
C. Palma, Opt. Comimui. 29 (1979), 256-260.] 


Suppose, as is usually the case, that the spectrum is the same at all source points, i.e. that 
5 , ®(p’, «) = S {0 \a>) say. The formula (17) of Section (5.3) then gives at once 


A 

(27t) 2 


S (0 H«), 


S (0) (0,ai) = 


(3) 
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for beams with the same initial r.m.s. beam radii (cr s = 0 . L cm), but with different degrees of 
coherence. The wavelength for each beam was taken as 6328 A. (b) The r.m.s. beam radii for 
beams with the same degree of coherence, with a = 0.2 cm, but with different initial r.m.s. 
radii a,. The wavelength for each beam was taken as 6328 A. (c) The r.m.s. beam radii for 
beams with different initial r.m.s. beam radii a s and w ith different widths a^ of the degree of 
coherence, but with equal far-held beam angles. # 3 . as predicted by the "‘equivalence theo¬ 
rem” discussed in Section 5.4. The parameters of the four beams are (i) a s — 0.1 cm and 
< 7 ^ = 00 , (ii) cr s = 0.109 cm and a t , = 0.5 cm. (iii) cr, = 0.167 cm and <r ; , = 0.25 cm and (iv) 
a, = 0.328 cm and tr„ = 0.2 1 cm. The wavelength of each beam was taken as 6328 A. 
[Adapted from J. T. Foley and M. S. Zubairy. Opt Common. 26 1 1978), 297-300.] 


where A denotes the area of the source. If we next substitute from Eq. (3) into Eq. (1), use 
Eq. (2) and solve for /7 (0) , we find at once that 


,a>) = 


J 101 


Ak 2 S (0 H(o)J 


1 -s 2 


where we have used the relation cos 9 = ^1 — s 2 . 


(4) 
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Next we take the two-dimensional spatial Fourier transform of Eq. (4) and neglect the 
contribution from the domain kl > 1, which is associated with evanescent waves whose 
contributions are negligible unless p i<)! varies rapidly on a scale of the order of a wave¬ 
length. We then obtain for p,® the expression 


, 10 ) 


(p'. Oi) 


jiO) 


Ak'-S&fa)) J [ 

slsl ’ 


fjb 


e^i-p'd 2 iks\ 


(5) 


Now since p (O) {0, to) = 1 one has from Eq. (5) 

jm 


] 


Ak 2 S {0> (a) J 

<1 » 




(ks 


( 6 ) 


From Eqs. (5) and (6) it follows at once that the spectral degree of coherence of the source 
may be expressed in the form 


p. (0) (p',w) 


Hp^) 

F(0,co) ’ 


(7) 


where 




I 


~iks , 




,p ' d 2 (A'S,). 


( 8 ) 


The integral on the right may be evaluated in closed form and one finds that (M&W, p. 248) 


fi < 0 >(p.w) 


sin (kp) 

kp 


(9) 


This formula implies that all quasi-homogeneous planar secondary Lambertian sources 
for which the spectrum is the same at each source point have the same spectral degree of 
coherence, given by Eq. (9). It is plotted in Fig. 5.15 as a function of the normalized dis¬ 
tance kp' = k\p 2 ~ pi| between two arbitrary source points. We see that the correlation 
distance A p' c of the light in the source plane is given by the order of magnitude expression 
(kAp' c ) ~ 7t/2 or, since k = 2 tt/\, 


A p' c ® A/4. (10) 

Thus we have shown that Lambertian sources are not spatially completely incoherent but 
are correlated over distances of the order of the wavelength. 
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The normalized far-zone spectrum is, therefore. 


s ,=c, (rs,&>) 


SW(rs,o>) 
r S^(rs,a})da) 

J 0 

k 2 S i0) (w)^ m (ks .a) 

r k 2 S m (fi))P- m (ks ,a>)dw 

JO ~~ 


and the normalized source spectrum is 


s (0) (rs. w) 


S' 0l (w) 

f S'°'(co)dw 
J o 


(3) 


(4) 


It is clear from a comparison of Eqs. (3) and (4) that, in general, the two normalized 
spectra will differ from each other, the difference being due to the spatial-coherence prop¬ 
erties of the source, which are characterized by the spectral degree of coherence, // (Ji . 

Examples of such “correlation-induced” spectral changes are given in Fig. 5.16, for radi¬ 
ation from a source with a Gaussian spectral profile. Figure 5.16 shows the differences in 
the far-zone spectrum in different directions of observation. We see that with increasing 
angle of observation from the normal to the source plane the spectral line undergoes a red- 
shift, i.e. a shift towards longer wavelengths (lower frequencies) Other types of changes 
may be generated with different spatial-coherence properties of the source as is illustrated, 
for example, by Fig. 5.17. 

It is clear from Eq. (3) that the normalized far-zone spectrum wall be independent of the 
direction s of observation if /7(As x , tu) factorizes in the form 


fi {0) (.ks ,a>) = F(o))H(s ). 


(5) 


The expression (3) then reduces to 


s (x Hrs,a>) 


k 2 S< 0 Hcj)F{a) 

k 2 S {0) (u))F(a}) dtu 
J o 


(6) 


The factorization condition (5) has some interesting consequences, if w e assume that it 
holds not only for the domain |sjJ 1 (which is a consequence of the fact that s is a unit 
vector) but also for all two-dimensional vectors s. Ev idently this will approximately be so. 
if at each frequency cu which is present in the source spectrum the spectral degree of coher¬ 
ence /t (0, (p\ oj) is effectively bandlimited to a circle of radius k about the origin: or. in more 
physical terms, if p {0) (p\ w) does not vary appreciably over distances of the order of the 
wavelength A = 2nd oj. Assuming this to be the case, we obtain, on taking the Fourier 
transform of Eq. (5), 
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Fig. 5.16 Illustrating the effect of spatial coherence of a planar, secondary source on the 
normalized far-zone spectrum of the emitted light. The normalized source spectrum 
is a line with a Gaussian profile, with a s /io 0 = 1/20 (a> 0 is the central frequency; a s is the 
r.m.s. width). The specttal degree of coherence was assumed to have a Gaussian spatial pro¬ 
file, with r.m.s. width er^ = IOAq, Ao being the corresponding wavelength. The normalized 
far-zone spectrum is plotted (a) on axis, (b) at 8 = 2° and (c) at 6 = 30°, [Adapted from Z. 
Dacic and E. Wolf, J. Opt. Soc. Amen A5 (1988). 1118-1126.] 


/t (0 Vp'. a>) = k 2 F(a))H(kp'). (7) 

where H is, of course, the Fourier transform of H. Since //"'(p . w) is a correlation coefficient, 

/j (0) (0. to) = 1 for all go. (8) 

Hence Eq. (7) implies that 

1 


k 2 F(eo) = 


H( 0) 


(9) 


Because the left-hand side of Eq. (9) depends on the frequency but the right-hand side is 
independent of it, it follows that 


fM - 


( 10 ) 


where a is a constant. 

Two important conclusions follow at once from these results. If we substitute from Eq. 
(10) into Eq. (6), we obtain the following expression for the normalized far-held spectrum: 

m, , S , 0 >(w) 

s ,u ’(rs.w) = —--. (11) 

S (0> (aj)dco 










S'*>(,(9. £0) Wk B T) 



Fig. 5,17 The far-zone spectra in different directions of observation, produced by a pla¬ 
nar, quasi-homogeneous source, with Gaussian spatial profile, whose spectrum is given 
by Planck’s law, (a) for the case when the spectral degree of coherence of the source is 
a Gaussian function, (b) The frequency a> m is the frequency at which the Planck spec¬ 
trum, indicated by dotted line, has its maximum. [Adapted from E, Wolf. Appl. Phys. 
B60 (1995), 303-308.] 


105 














106 


Sources of different states of coherence 


This formula shows not only that the normalized spectrum of the light is now the same 
throughout the far zone, i.e. it is independent of the direction of observation, but also that 
it is equal to the normalized source spectrum. 

When the condition (10) is satisfied and, consequently, when the normalized far-zone 
spectrum is the same as the source spectrum, the spectral degree of coherence necessarily 
has a certain functional form. This conclusion follows at once on substituting from Eq. (10) 
into Eq. (7) and setting aH(kp') = h(kp'). We then obtain for the spectral degree of coher¬ 
ence of the source the expression 

p (0) (p'.o)) = h(kp), (12) 

where, as before, p' = p 2 — p t . For obvious reasons this formula is known as the scaling 
law. According to the preceding analysis, any planar, secondary source whose spectrum is 
the same at each source point and which obeys the scaling law [Eq. (12)] will generate 
light whose normalized spectrum will be the same throughout the far zone and will be 
equal to the normalized source spectrum . 1 2 

We have learned earlier that the spectral degree of coherence of a planar, quasi-homoge- 
neous secondary Lambertian source whose spectrum is the same at each source point is 
[Eq. (9), Section 5.5] 


P !0, (P%0 


sinikp') 

kp 


(13) 


i.e. a Lambertian source satisfies the scaling law. The fact that mam laboratory' sources and 
also many sources encountered in nature are Lambertian may explain why “spectral invari¬ 
ance” of light on propagation has for so long been generally , but incorrectly, taken for granted. 

We have considered only spatial invariance of the spectrum throughout the far zone. 
Elowever, a more general result may also be established, namely that the normalized spec¬ 
trum of the field generated by any quasi-homogeneous scaling-law source is, to a good 
approximation, the same throughout the half-space into which the source radiates, except 
perhaps at points at distances of the order of a wavelength or less from the source plane.- 

The prediction that the spectrum of light may change on propagation, even in free space, 
was tested experimentally. The first tests were carried out with two different sources, both of 
thermal origin. In the first tests a tungsten lamp was located in front of an aperture in plane I 
[see Fig. 5.18(a)] and the light which passed through the aperture produced a secondary 
source in plane II, after it had passed through an optical system. Spectra in both these planes 
and also in a plane III located in the far zone of the secondary source were measured. 

The first optical system was an ordinary lens [Fig. 5.18(a)], The secondary source in 
plane II can be shown to have a spectral degree of coherence which obeys the scaling law. 
The second system [Fig. 5.18(b)] was a Fourier achromat consisting of a combination of 


1 We established the condition (12) as a sufficiency condition for "spectral invariance." The condition is also a 
necessary condition [E. Wolf, J. Mod. Opt. 39 (1992). 9-20. Theorem 11, p. 19], 

2 H. Roychowdhury and E. Wolf. Opt. Commun. 215 (2003). 199-203. 




Normalized spectral intensity 




Fig. 5.18 Two systems used to illustrate the validity of the scaling law for a planar, sec¬ 
ondary. quasi-homogeneous sources. [Adapted from G.M. Morris and D. Faklis. Opt. 
Commun. 62 (1987), 5-1 l.J 




(a) 


Frequency uj GO 15 s ') (b) 


Frequency w (10 15 s ') 


Fig. 5.19 (a) Observed spectra when the scaling law is satisfied, (b) Observed spectra 
when the scaling law is not satisfied. [Adapted from G. M. Morris and D. Faklis, Opt. 
Commun. 62 (1987), 5-11.] 
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optical elements designed for the use in white-light processing. One can show 1 that with this 
system the light in plane II does not obey the scaling law'; in fact it is effectively independ¬ 
ent of the frequency over the whole frequency range for which the lens has been achroma¬ 
tized. According to the theory which we outlined, one must expect that in the first case the 
normalized spectrum of the light in plane III will be the same in all directions of observa¬ 
tion 9 , whereas in the second case it will depend on 9. 

Figure 5.19 shows the result of the experiments. Figure 5.19(a) shows the measured spec¬ 
trum at all points in the far zone with the first system (a conventional lens). Since, as we have 
already noted, the scaling law is satisfied in this case, the theory' predicts that the normalized 
spectrum of the light is the same for all 9 directions and is the same as the normalized spec¬ 
trum of the secondary source. This is indeed what the experiments demonstrated. In Fig. 
5.19(b) the measured far-zone spectra obtained in the experiment which used the Fourier 
achromat are shown. Since in this case the secondary source in plane II does not obey the 
scaling law, the normalized spectra observed in different directions should depend on 9. This 
was experimentally confirmed, with the results shown in Fig. 5.19(b). 


PROBLEMS 


5.1 A planar secondary source occupies a finite domain in the plane z = 0 and radiates into 
the half-space z > 0. 

(a) If the source fluctuations are characterized by an ensemble which is stationary, 
show that the radiant intensity of the field produced by the source may be 
expressed in the form 


J a ( s) = k 2 AC(ks . w)cos 2 (9. 


where C(K, oj) is the Fourier transform of the so-called source-averaged correla¬ 
tion function C(r', o>), defined by the formula 



where A denotes the area of the source. 

(b) Use the formula of part (a) to derive expressions for the radiant intensity when the 
source is (i) of Schell-model type; and (ii) quasi-homogeneous. 

5.2 A planar, secondary, quasi-homogeneous source <r of finite extent radiates into the 
half-space z > 0. The total radiated flux at frequency ai is given by the expression 


F = f J (s)dft, 
“ Ja w 


where J m ( s) is the radiant intensity generated by the source and the integration extends 
over the solid angle of 27 t subtended at the source by the hemisphere at infinity in the 
half-space z > 0. Show that 
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F W < f S l0) ( r, co)d 2 r, 


where S , 0 \r, to) is the distribution of the spectral density across the source. 

5.3 Obtain conditions under which two planar, secondary. Gaussian Schell-model sources 
will generate fields that have the same spectral degree of coherence throughout the far 
zone. 

Derive also expressions for the corresponding radiant intensities produced by two such 
sources. 

5.4 A three-dimensional spatially incoherent primary' source occupies a finite domain D. 
Its fluctuations are statistically stationary and the spectrum of the source is the same at 
every point. 

(a) Show that the normalized spectrum of the field is the same at every point outside 
the source domain D. 

(b) Derive an expression for the spectral degree of coherence in the far zone. 

5.5 (a) Find a sufficiency condition under which two three-dimensional statistically sta¬ 

tionary, primary, quasi-homogeneous source distributions Q(r, to) generate fields 
which have the same far-zone spectra. 

(b) Give an example of two such sources. 

(c) Consider also the special case of (a) when the sources obey the scaling law, i.e. 
when the spectral degree of coherence of each source has the functional form 


Pg(r’, to) = h(kr'), (k = told). 


5.6 Consider a statistically stationary three-dimensional source occupying a finite volume. 

(a) Derive an expression for the radiant intensity in terms of the coherent modes of the 
source. 

(b) If the integral of the radiant intensity over all directions (i.e. directions that fill the 
complete 4 tt solid angle) has zero value, the source may be said to be non-radiat¬ 
ing. Show that, if the source is non-radiating, then all of its source modes are also 
non-radiating. 

5.7 Show that the radiant intensity generated by a fluctuating three-dimensional source 
distribution Q( r, t) is given by the expression 


JJs) = (2TT) 6 W Q (-ks, ks, to). 


where Wq (Kj, K 2 , to) is the six-dimensional spatial Fourier transform of the cross- 
spectral density function W Q (r u r 2 , to) = w)Q(r 2 , to)) of the source, i.e. 



WqCKj.IL,, u) 
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Show also that when the source is spatially completely incoherent the radiant intensity 
is independent of direction. 

5.8 Consider a completely coherent, statistically stationary, three-dimensional source dis¬ 
tribution Q(r, 1 ) occupying a finite volume. The cross-spectral density of such a source 
has the factorized form 


W Q { r,\ r' t to) = G*(r', co)G( r', to). 

(a) With the help of the result stated in the previous problem, find an expression for 
the radiant intensity of the field generated by the source. 

(b) Suppose that the source is uniform, equi-phasal and spherical. Show that for cer¬ 
tain values of its radius the source will not radiate. 
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Coherence effects in scattering 


When light is incident on an object, it deviates from its original path, i.e. it is scattered by 
the object. There are many different types of scattering, for example, scattering by atoms 
or molecules, by dust particles, or by macroscopic bodies. The bodies may be homoge¬ 
neous or inhomogeneous, isotropic or anisotropic and their behavior may be changing in 
time. One then speaks of static or dynamic scattering, respectively. The response of the 
scatterer may be linear or non-linear and the medium may be deterministic or random. 

It is clear from these remarks that light scattering is a very broad subject. In this chapter 
we will consider only one class of scattering process, albeit a rather broad one: namely 
scattering on a linear, isotropic, statistically stationary medium. We will discuss both deter¬ 
ministic and stochastic fields incident on a static scattering medium, both when the medium 
is deterministic and when it is stochastic. 


6.1 Scattering of a monochromatic plane wave on a 
deterministic medium 


Let us first consider the scattering of a monochromatic wave 

V%r, t) = C'Hr, «)e~ lw '. 


( 1 ) 


incident upon a linear scatterer. which occupies a finite domain D in free space (Fig. 6.1). 
Here r denotes the position vector of any point either outside or inside the scatterer, t 
denotes the time and to denotes the frequency. We assume that the physical properties of the 
medium are characterized by a refractive index «(r, to). 


Let 


V(r, t) = U{ r, 


( 2 ) 


be the total field at a point r U( r, to) then satisfies the equation 
V 2 (/(r, to) + k 2 n 2 ( r, w)U( r, o>) = 0, 


(3) 


where k is the free-space wave number associated with frequency to, i.e. 


k = tu/c, 


(4) 
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incident wave £7® 



scattered wave U (s l 


Fig. 6.1 Illustrating the notation relating to scattering. 


c being the speed of light in vacuum. It is convenient to rewrite Eq. (3) in the form 

V 2 Mr, to) + k 2 U) r. w) = -4~F(r. to)Uir. to). (5) 

where the quantity 


F(r.io) 


— - 1 ] 

4tt 


( 6 ) 


is called the scattering potential of the medium. Because of the relation 

n 2 (r, to) = 1 + 4-t)(r. to) (7) 

between the refractive index «(r, to) and the dielectric susceptibility r/(r, to), the scattering 
potential may be expressed in the simple form 

F(r, to) = k 2 t)( r. to). (8) 

Let us represent the field U( r, to) as the sum of the incident field £/®{r, to) and the scat¬ 
tered field [f s> ( r, to), viz., 

Mr, oj) = lf l> ( r, to) + M s ’(r, w). (9) 

This relation may, in fact, be regarded as defining the scattered field M s) (r, to). The incident 
field will be assumed to satisfy the Helmholtz equation 

(V 2 + A 2 )M"(r, w) = 0 (10) 


throughout all space. 
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By the use of well-known vector identities one can show from Eqs. (5) and (10), when 
Eq. (9) is also used, that, with the scattered field f/ s) (r, o>) assumed to behave at infinity as 
an outgoing spherical wave, the total field obeys the equation (cf. B&W, Section 3.1.1) 

U(r.oj) = U a Hr.co) + F(r', to) U(r’, w)G{|r - r'|,£t>)dV, (11) 

where G(R, to) (R = |r — r'|) is the outgoing free-space Green function 

G(R. oj ) = e kK !R (12) 

of the Helmholtz operator. Equation (11) together with Eq. (9) is the basic integral equation 
for the scattered field, for scattering of a monochromatic wave on a medium characterized 
by the scattering potential F(r, to). It is generally known as the integral equation of poten¬ 
tial scattering. 

In general, it is not possible to solve Eq. (11) in closed form. However, when the scatter¬ 
ing is weak, a relatively simple approximate closed-form solution of Eq. (11) can readily 
be deduced. By weak scattering we mean that the magnitude |f/ s, | of the scattered field is 
much smaller than the magnitude 11/ 1 ’! of the incident field, i.e. that 

|f/ s \r, a>)\ < |t/ ti5 (r, o>)| (13) 

throughout the scatterer. It is clear from Eqs. (6) and (11) that the scattering will be weak 
if the refractive index differs only slightly from unity. In such a case, one can, to a good 
approximation, replace the total field U in the integral in Eq. (11) by the incident field lf l \ 
The integral equation (11) then becomes 

U(r.to) = U (,] (r.to) + f F(r', a>)t/ (1) ^r , .w)G(|r — r , |, told 3 /. (14) 

D 

which is known as the first-order Born approximation to the solution of the integral equa¬ 
tion of potential scattering. We note that, unlike Eq. (11). Eq. (14) is not an integral equa¬ 
tion but is, in fact, a solution to the scattering problem in terms of the incident field If" and 
the scattering potential F(r, to). 

It is of interest to note that within the accuracy of the first-order Born approximation 
Eq. (5) reduces to 

V 2 U(r, to) + k 2 U( r, to) = —47rF(r, to)lfi®{T, to). (15) 

This equation is identical to the equation for radiation from a scalar source distribution 1 

p(r. to) = F(r, tu)f/"(r, to), (16) 

occupying the volume region D. Hence within the accuracy of the first-order Born approx¬ 
imation the process of scattering on a static, linear medium and the process of radiation 
from a localized source distribution are mathematically equivalent to each other. 

1 See. for example, C. H. Papas. Theon of Electromagnetic Wave Propagation (McGraw-Hill. New York. 1965). 
Eq. (56), p. 11. 



114 


Coherence effects in scattering 


P(r) 



Fig. 6.2 Notation relating to derivation of the far-zone approximation (19) of Section 
6.1 to the free-space Green function G(|r - r'|, w). 


For later purposes we derive from Eq. (14) an expression for the scattered field in the far 
zone of the scattering medium, assuming that the wave incident on the scatterer is a mono¬ 
chromatic plane wave of frequency w, which propagates in the direction of a real unit vec¬ 
tor s n : 


U U) (r,a>) =«(w)e lfai o' r . (17) 

Under these circumstances Eq. (14) becomes 

U(r,<a) = a(o))e ifai(, ' r + a(w) f F(r', a>)e ifc V r 'G(jr - r'| <w)dV, (18) 


where Green's function GiR. u>) = G( r — r' |, w) may now he approximated by the 
expression 


G(|r - r'j, w)~ 



(19) 


The approximation given in Eq. (19) is esident from Fig. 6.2, where r = OP and 
l r _ r 1 ~ QP ~ OP — ON « r — s • r', /V being the foot of the perpendicular dropped 
from a point Q( r') onto the line OP. 

If we make use of Eq. (19) in Eq. (14) and use Eq. (17) the field in the far zone is seen to 
be given by the expression 


U(rs,u>) 


a(0L>) 




+ A(s. oj) 



( 20 ) 


where 


A(s,w) = f F(r',co)e ^ s - s o^ r d 3 r'. 

•J D 


( 21 ) 


In mathematical language, Eq. (20) represents the asymptotic approximation to the far field 
as kr^>cc in the fixed direction s. The amplitude function A(s, o>) of the spherical wave on 
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the right-hand side of Eq. (20) is known as the scattering amplitude. If we introduce the 
Fourier transform F(K, a>) of the scattering potential. 

F(K.w) = f F( r',w)e lK ' r dV, (22) 

•J D 


then Eq. (21) implies that 

A(s.cu) = F[k(s — s 0 ).tu]. (23) 

Equation (23) shows that, within the accuracy of the first-order Bom approximation, the 
scattering amplitude A(s, to) is equal to a component of the three-dimensional spatial 
Fourier transform of the scattering potential, namely the component labeled by the spatial- 
frequency vector K = k(s — s () ), s n being the unit vector along the direction of incidence. 

6.2 Scattering of partially coherent waves on a deterministic medium 

We will now consider the more complicated situation when the light incident on a deter¬ 
ministic scatterer is not monochromatic but is partially coherent. The fluctuations of the 
light are assumed to be statistically stationary, at least in the wide sense. 

Let r 2 , «) be the cross-spectral density function of the incident light. Then as we 

learned earlier, W 1 may be expressed in the form [Eq. (13) of Section 4.1 ] 

W (l) (r r r,,w) = (U Ui *(r l , «)t/ (l) (r,, w)}, (1) 

the angular brackets denoting the average over the statistical ensemble of monochromatic 
realizations of the incident field. 

The scattered field may likewise be represented by an ensemble of monochromatic real¬ 
izations, lf s) (r, o>), whose cross-spectral density function may be represented in a similar 
form, viz.. 


W (s) (r,, r 2 ,oj) = (£/ (s) *(r,, w){/ fs) (r 2 ,w)). ? 

The scattered field f/ sl and the incident field If" are related by the integral equation of 
potential scattering. Within the accuracy of the first-order Born approximation the scattered 
field If" is given by the integral on the right-hand side of Eq. (14) of Section 6.1. Using 
that relation and Eqs. (1) and (2) it follows that 

W { ^l(rpr,,w) = J J VW’fr/,r',w)F*(r',wlFfr,,w) 

X G*(|r, - r'|, (o)G( Jr 2 - r 2 |» w)d 3 r 1 'd 3 r 2 . (3) 

To obtain a clearer insight into some consequences of Eq. (3). let us express the cross- 
spectral density W’fr,, r 2 , a>) in terms of the spectral densities S^rf, w) and S (l) (r 2 , w) of 
the incident light and of its spectral degree of coherence [Eq. (6b) of Section 4.2], viz.. 
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W (i V/.r',w) = r/, at) r', w) r', &>). (4) 

Equation (3) for the cross-spectral density of the scattered field then becomes 

X G*(|r, - rj|,w)GC|r 2 - r^wldV/d 3 r 2 . (5) 

If, as is frequently the case, the spectrum of the fight incident on the scatterer is independ¬ 
ent of position, then we may set S^Crf, oj) = «) = in Eq, (5), which then 

becomes 

a) = 5 (1 ’(tu) f f /u (I) (r'r^.aj)F*(r,\o))F(r',aj) 

^ J D J D ~ 

X G*(tr, - r'|,ai)G(|r 2 , r'|, w )d 3 r/ d % (6) 

In particular, if the two field points coincide, i.e. if r, = r 2 = r say, then the left-hand side 
of Eq. (6) will represent the spectrum of the scattered field, 5 (s) (r, o>), and the equation 
reduces to 

S (s) (r,tn) = S (1, («) f f Lta)F (r'.adElr,'. to) 

X G (|r - rj f |, w)G(|r - r : '],tu)d 3 ;-' d 3 r 2 ' (7) 

This formula shows that, even on static scattering, the spectrum of the scattered field dif¬ 
fers, in general, from the spectrum of the incident field, the change arising from (1) the spa¬ 
tial coherence properties of the incident light, (2 ) the frequency dependence of the potential 
(due to dispersion of the medium) and (3) the frequency dependence of the free-space 
Green function. Usually, with narrow-band incident light, the change will be caused mainly 
by the spatial-coherence properties of the incident light, which are characterized by the 
spectral degree of coherence /i (1) (r|f, r 2 , oj). 

We noted earlier [see the remark that follows Eq. (15) of Section 6.1] that, within the 
accuracy of the first Born approximation, the processes of radiation and of scattering are 
equivalent to each other. We also learned in Section 5.6 that the spectrum of a field radiated 
by a partially coherent source may differ from the spectrum of the source. Hence the result 
that we just derived, namely that the spectrum of the light scattered on a time- 
independent medium (i.e. for static scattering) will, in general, differ from the spectrum of 
the incident light, was to be expected. 

We will illustrate some of the preceding results by a simple example. Suppose that a 
polychromatic plane wave, propagating in the direction of a unit vector s 0 , is incident on the 
scatterer. We may represent the wave by a statistical ensemble (denoted by curly brackets) 


{G (i, (r.w)} = {a(w)}e iU «‘ r 


(8) 
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a(a>) being a random amplitude. On substituting from Eq. (8) into Eq. (1) we obtain for the 
cross-spectral density function of the wave incident on the scatlerer the expression 

W (i) ( ri ,r 2 ,w) = 5 (i) (w)e iis o‘ (r 2“ r '\ (9) 

where 

S (1, («) = (a (oj)a(oj)) (10) 

represents the spectrum of the incident wave. It follows at once from Eq. (9) and from 
Eq. (4) that the spectral degree of coherence of the incident wave is given by the expression 

/i (l) (r r r,. w) = e lH o' (r :~ r i\ (11) 

Since in this case |^‘ , (r], r 2 , w)| = 1 for all pairs of points rj and r 2 . the incident wave is 

spatially completely coherent at the frequency to throughout all space. 

It follows from Eqs. (5) and (11) and from the fact that the spectrum of the incident light 
is now independent of position that the cross-spectral density of the scattered light is, 
within the accuracy of the first-order Born approximation, given by the expression 

W ,s) (r,. r 2 , oj) = (t/ s) *(r,. tu)f/% 2 , w)), (12) 

where 

t/ ,sl (r, w) = f F(r',a>)G(lr - r'|, w)e iis «' r ' d-V. (13) 

Since the cross-spectral density W* sl (r] r 2 , oj) is now the product of a function of r, and a func¬ 
tion of r 2 , it follows at once that the spectral degree of coherence /r ls, (r , . r 2 . oj) of the scattered 
field, which may be defined by a formula of the form of Eq. (4), is unitnodular. implying that 
the scattered field is also spatially completely coherent at frequency oj. The fact that in the pres¬ 
ent case the state of coherence has not changed on scattering is rather exceptional. In general 
not only the spectrum of the light but also its state of coherence will change on scattering. 

One is frequently interested only in the behavior of the scattered field in the far zone. 
One may readily derive an expression for t/ sl (r, oj) by making use of the far-zone approx¬ 
imation [Eq. (19) of Section 6.1] for the free-space Green function. Equation (13) then 
becomes 


U (i Hrs,oj) = yJs (i Hoj) — f F(r',o>)e~ ik(s ~ So) ' r dV 
r J D 

= Vs (i) M>[*(s - s 0 ),w] —, (14) 

r 

where F(K, to) is the three-dimensional Fourier transform of the scattering potential, defined 
by Eq. (22) of Section 6.1. From Eqs. (14) and (12) it follows that the cross-spectral 
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density of the scattered field and the spectral density in the far zone are given by the 
expressions 


= — S E,) (M)F*[fc(s 1 - s Q ), co]F[k(s 2 - s 0 ), w] 


(15) 


and 


5 (s) (rs, w) = W (s) (rs, rs. w) 


\ s^^mkis 

r L ' 



(16) 


The preceding formulas refer to coherence properties in the space-frequency domain. Let 
us now briefly consider coherence properties in the space-time domain. As we learned ear¬ 
lier, these properties are characterized by the mutual coherence function T(ri, r 2 , r), which 
is the Fourier frequency transform of the cross-spectral density [Eq. (2) of Section 4.1], In 
the present case, when the wave incident on the scatterer is a polychromatic plane wave, its 
cross-spectral density function is given by Eq. (9) and hence 

r fD (rj,r 2 ,r) = S©( w )e ii VC r 2 - r l l e ~ iMr dw 

= f o ^ S«(w)e _i “ [T ‘ s o^ ^ c] dw. (17) 


The degree of coherence 7 ®^, r 2 , r) of the wave is obtained by normalizing Eq (17) in 
accordance with Eq. (10) of Section 3.1 and one finds that 


7®(r r,,T) = 0 


f 5®(w)e _i<u[T_5 » r -- r , > ' c l d.w 
Jo 


f S 0, (w)doj 
Jo 


(18) 


The mutual coherence function of the scattered field in the far zone is obtained by taking 
the Fourier transform of Eq. (15), which gives 

r ts) (rs,,rs 1 ,r) = — f S 0) (co) F“[k( s. - s 0 ), &»] F[k(s 2 - s 0 ) r w]e H “ r dio. (19) 

y2- \J 0 

An expression for the degree of coherence 7 (s) (rs b rs 2 , r) of the scattered field is obtained 
by normalizing Eq. (19) in accordance with Eq. (10) of Section 3.1, as above. 

6.3 Scattering on random media 

6.3.1 General formulas 

So far we have assumed that the scatterer is deterministic. The scattering potential F( r, «) is 
then a well-defined function of position. Frequently, however, this is not so; the scattering 
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potential is then a random function of position. An example of such a situation is the turbu¬ 
lent atmosphere in which the refractive index varies randomly both in time and in space, due 
to irregular fluctuations in temperature and pressure. Over sufficiently short intervals of 
time, typically of the order of a tenth of a second, the temporal fluctuations may be neglected 
and one then speaks of a “frozen model” of the turbulent atmosphere. Under these circum¬ 
stances one is essentially dealing with static scattering, to which our considerations apply. 
For such situations an expression for the cross-spectral density of the scattered field that 
is valid to within the accuracy of the first-order Bom approximation is obtained by averag¬ 
ing Eqs. (3) and (5) of the preceding section over the ensemble of the scattering medium. 
We will denote this average by angular brackets with subscript m (i.e. we will write it 
as {. . We then obtain, from Eq. (5) of the preceding section, the formula 

(r,, r 2 ,«) = f f yj S (il (r,'. w) ^5" 1 (r 2 . (r,. r 2 . w) C F (r', r,', oj) 

J D*J D 

X G* (Jfj - 4 w)G(|r a - r'|, oi) d-V/ d-V 2 ', (1) 


where 


C F { r/.r',m) = {F\r[^)F{r^w)) m ( 2 ) 

is the correlation function of the scattering potential. It should be noted that Eq. (1) con¬ 
tains implicitly two different averages, namely one taken over the ensemble of the incident 
field and the other over the ensemble of the scatterer. Since we have assumed that the scat¬ 
tering is weak fin the sense of the first-order Bom approximation) we can treat the two 
averaging processes independently of each other. 

The spectral density of the scattered field is obtained at once from Eq. (1) by setting 
r, = r 2 = r and we then find that 

SW(r,w)= f f yjsto (tf. to) (r', cu),# (r', r', w) C F (r/, r,', w) 

J dJ D 

X G’(|r - r'|, «)G(|r - r 2 '|, w)d 3 r/ d 3 r 2 '. (3) 

Although Eqs. (2) and (3) involve averages over macroscopically similar but microscop¬ 
ically different scatterers, one can often deduce the values of these averages, at least to a 
good approximation, from experiments involving only a single scatterer. For example, the 
necessarily finite size of the detector aperture will frequently provide spatial averaging, 
which is essentially equivalent to ensemble averaging. 1 


See L.G. Shirley and N. George, Appl. Opt. 27 (1988), 1850-1861, Section II, and J. Goodman^ contribution in 
Laser Speckle and Related Phenomena, second edition, J.C. Dainty ed. (Springer, New York, 1984), Section 2.6.1. 
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Expressions for the cross-spectral density and for the spectrum of the scattered field in 
the far zone may be obtained at once from Eqs. (1) and (3) by again using the far-zone 
approximation (19) of Section 6.1 for the free-space Green function. Equations (1) and (3) 
then give 

W^HrSp/is^w) = I f ^S {i > (r' 2 ,w)p* (r/, r' 2 ,to)C F ( r/,r', w) 

r~ J d J d 

X dV/ d 3 r 2 ' (4) 


and 


5’ <s) (rs,.w) -y f f Js (i) ( r/, w) yjs (i) (r 2 . a> )p (i) (r/, fi.a>)C r (r,'.r(,oj) 

r~ J dJ d 

X e Hi! >’£ r 2 ~ r i') d 3 r/d 3 r 2 ' (5) 

as kr —> oo, with the directions s b s 2 and s being kept fixed. An expression for the spectral 
degree of coherence of the scattered field in the far zone is obtained on substituting from 
Eqs. (4) and (5) into the formula (6b) of Section 4.2 which defines the spectral degree of 
coherence, viz.. 


p ,s) (rs 1 ,rs-,,w) 


VV' (si (rs | .rs-,.£u) 
^5 ,s) (rs,.w) ^S ,s, (rs 2 .oj) 


( 6 ) 


We have assumed so far that the scatterer is a continuous medium. The analysis may 
readily be extended to scattering by a system of particles, a case of considerable practical 
interest. If the scattering potential of each particle is the same, say fir, w) and the particles 

are located at points with position vectors rj, r : .the scattering potential of the whole 

system of particles is 


T(r,w) = J2f (r ~ r n' w) - C 7 ) 

n 

The correlation function of this scattering potential is evidently 

C f (rj,r 2 ,w) = (F*(r v w)F(r 2 ,ta)) = i “ _ r ,,»“)). ( 8 ) 

m n 

where the expectation value is taken over the ensemble of the particles. 

The formulas that we have derived in this section and in the preceding one may be used 
to elucidate many features of the scattering of the light of any state of coherence, on a deter¬ 
ministic or on a random medium. We will now illustrate them by some examples. 
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6.3.2 Examples 

When the incident field is a spatially coherent, polychromatic plane wave, its cross-spectral 
density is given by Eq. (9) of Section 6.2. On substituting that expression into formula (4). 
we obtain for the cross-spectral density function of the scattered w ave in the far zone the 
expression 


W fe J(/Ts,,rs 2 ,w) ~ 4- 5<il («) f f e ita »- <r -~ r /» c f ( r'.^wle^^'^V^d-V/d-V' 

J nJ n 


- S (| ). fc(S, - S„), 


W|, 


(9) 


where 


C F <K r K,,w) = f D f D C F ( r'r;, W )e- lK r r r K - [! dV;dV; (10) 

is the six-dimensional spatial Fourier transform of the correlation function Q (r(, rL w) of 
the scattering potential. 

The spectrum of the scattered field in the far zone of the scatterer is obtained at once 
from Eq. (9) on setting Sj = s 2 = s. One then finds that 

S^(rs.oj) -— S (i) (co)C F [-fc(s - s„), k(s - s„), w], (II) 

r- 

Suppose that the scatterer is a Gaussian-correlated, homogeneous isotropic medium. Its 
correlation function then has the form 


C f (r 15 r 2 ,w) 



e -jr,-r, | 2 /(2 <t 2 ) 


( 12 ) 


where A and a are positive constants. We assume that a is small compared with the linear 
dimensions of the scatterer. Using Eqs. (10) — (12) one finds after a straightforward calcula¬ 
tion that the spectral density of the scattered field in the far zone is given by the expression 

AV , , 

S (i) (rs.a>) = — S®{a))e- 2(k<!) ***'««), (13) 

r 2 


where V is the scattering volume and 9 is the angle of scattering (s • s 0 = cos 9). 

Let us compare this situation with the case when the incident light is ambient, rather than 
spatially coherent. The cross-spectral density function of the incident field has the form 


W (i >(r,,r 2 ,w) = £&(«) 


sinq-|r 2 - r,|) 


(14) 
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Fig. 6.3 The ratio f(6: ka) = [S (s) (a>)] coh /[S ,<s) ] amb of the spectra of a spatially coherent 
plane light wave and of ambient light, scattered on a Gaussian-correlated isotropic 
medium of r.m.s. width a. calculated to within the accuracy of the first Bom approxima¬ 
tion. [Adapted from J. Jannson, T. Jannson and E. Wolf, Opt. Lett. 13 ( 1 988), 1060-1062.] 


rather than the form given by Eq. (9) of Section 6.2. In this case. Eq. (5) may be shown, 
after some calculations, to give 



S©(«)[l - e” 2 '*' 7 * 2 


(15) 


One may readily verify that when ka < 1 (highly uncorrelated scatterer) Eqs. (15) and 
(13) become identical. However, when ka > 1, i.e. when the correlation length a of the 
scatterer is much greater than the reduced wavelength V(2tr) = 1 Ik = c/co of the incident 
light, the angular distributions of the scattered light in the far zone in the two cases are com¬ 
pletely different. This fact is illustrated in Fig. 6.3. 

On returning to Eq. (11) we see, as was to be expected from our earlier remarks relating 
to spectral changes produced on scattering and from the comment following Eq. (16) in 
Section 6.1, that the spectrum of the scattered field in the far zone will, in general, differ 
from the source spectrum. 1 

Effects of spectral coherence on the angular distribution of the scattered light were stud¬ 
ied experimentally 2 and confirmed some of the main features relating to effects of the state 
of coherence of the incident light on the angular distribution of the scattered radiation. 

As another example, let us consider the scattering of a spatially coherent polychromatic 
plane wave on a random distribution of identical particles. The spectral density of the scattered 

1 See, for example, E. W 7 olf, J. T. Foley and F. Gori,./. Opt. Soc. Amer. A6 (1989), 1142-1149; errata. Ibid. A7 
(1990), 173. 

2 F. Gori, C. Palma and M. Santersiero. Opt. Cotmmm. 74 (1990), 353 -356. 
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light in the far zone is given by Eq. (11), with C F defined by Eq. (10). The quantity C F in this 
case, is given by the formula 

C f (-K. K, w) = EE f D f D (f* (r l ~ V w ).Ar 2 - r „< w))e“ i[K ' (r '' r <' )1 d-V/ d 3 ^. 

m n 

(16) 

Let us introduce new variables R lm = r[ - r,„, R 2 „ = r 2 - r„. The formula (16) then becomes 
C F (- K, K, to) = (EE e ' lK ' ,r "' rJ 

\ m n 

X U D {f(R ^ W)f{R ^ ^R 2h , (17) 

which may be expressed in the compact form 

C F ( -K. K. co) = |/(K. w)| 1 2 5(K). (18) 

where / (K, or) is the Fourier transform of/(r. co) and 


5(K) 


E e_iK r "’ 


(19) 


is the so-called generalized structure function of the particle system.^ 

On substituting from Eq. (18) into Eq. (11) we obtain for the spectrum of the scattered 
field in the far zone the expression 

S (,i) (rs, co) = -^S©(w)|/[/r(s - s Q ), co] S[k (s - s Q )]. (20) 

This formula shows that the spectrum of the incident wave is changed by scattering on a 
system of particles, the change being caused by the scattering potential fir, co) oflhe indi¬ 
vidual particles and by the structure function 5(K) of the particle system. 

Some examples of the changes in the spectrum produced by particle scattering were dis¬ 
cussed by Dogariu and Wolf/ The possibility of utilizing spectral changes generated by scat¬ 
tering to determine density correlation functions in some particle systems was also noted. 3 

6.3.3 Scattering on a quasi-homogeneous medium 4 

An important class of random media is constituted by the so-called quasi-homogeneous 
media (also called locally homogeneous media). 5 To explain the basic properties of such 

1 This function is proportional to the structure factor, which plays an important role in the theory of disordered 
systems. [See. for example. J.M. Ziman, Models of Disorder (C ambridge University Press, Cambridge. 1979)]. 

2 A. Dogariu and E. Wolf. Opt. Lett. 23 (1998). 1340-1342. 

3 G. Gbur and E. Wolf. Opt. Common. 168 (1999), 39~45. 

4 The analysis in this section is largely based on the following papers: W. H. Carter and E. Wolf. Opt. Common, 
(si (1988). 85-90; D. G. Fischer and E. Wolf. J. Opt. Soc. Amei: All (1994). 1128-1135: and T. D. Visser. D. G. 
Fischer and E. Wolf, J. Opt. Soc. Amen A23 (2006). 1631-1638. 

5 Scattering from such media appears to have been first considered by R. A. Silverman, Proc. Cambridge Philos. 
Soc. 54(1958). 530-537. 
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media, we first introduce the normalized correlation coefficient of the scattering potential, 
defined by the formula 


M r i< r 2- 


_c f (r,, r 2 , Qj) _ 

-JC F ( r,, r,, a>) yjC F (r 2 , r 2 , <o) 


( 21 ) 


where C F (r 1? r 2 , u>) is the correlation function, defined by Eq, (2), of the scattering poten¬ 
tial F( r. u>). It is useful to introduce also the function 

I fir, at) = C F ( r, r, to) = (F*( r, oj)F( r, «)),„, (22) 

which is a measure of the strength of the scattering potential at the point r. 

A quasi-homogeneous medium is characterized by the properties that at each frequency o> 

(1) the normalized correlation coefficient Ff( r u Lp &>) depends on the two spatial variables 
Tj and r 2 only through the difference r 2 — iq. in which case we will write [i F (r 2 — rq, at) 
in place of /t F (iq, r 2 , w); and 

(2) the strength I F ( r, to) of the scattering potential varies much more slowly with r than 
11,r{r 2 ~ iq, (u) varies with the difference r' = r 2 — iq. Hence / F (r, w) remains nearly 
constant over distances for which |/i F (r', w)| has an appreciable value. 

These properties are sometimes expressed by saying that I fir. w) is a slow function of r 
and that Hf( r ', to) is a fast function of r'. Examples of media w ith such properties are the 
troposphere and confined plasmas. 

The conditions (1 i and (2) are strictly analogous to conditions that characterize quasi-homo¬ 
geneous sources, as we discussed in Section 5.3.2. We show ed there 1 that such sources and the 
radiation generated by them obey certain reciprocity relations. Because of the previously men¬ 
tioned analogy between radiation and scattering, w e can expect that somewhat similar results 
will apply in connection with scattering. We will now show that this is indeed the case. 

Let us rewrite Eq. (21) in the form 

C F (r,. r 2 , at) = Jl F ( r,, at) ^ / F (r ; . w)/4 F ( r,, r 2 , w). (23) 


Because of the properties which characterize quasi-homogeneous scatterers, C F may be 
approximated by the formula 


C/r(l*p Tj, to) Ip 


■ , to 


%(r 2 - f, at). 


(24) 


Suppose now' that the wave incident on such a medium is a polychromatic plane wave of 
spectral density 5(a>), propagating in a direction specified by a unit vector s 0 . According to 
Eq. (11) of Section 6.2, the spectral degree of coherence of such a wave has the simple form 


In Section 5.3.2 we considered planar quasi-homogeneous sources. However, it is not difficult to show that 
strictly similar results apply to three-dimensional sources of this class (see M&W. Section 5.8.2). 
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^ (l) (r p r 2 , w) = e u ' s °' (r ^ *i\ 


(25) 


On using Eq. (25) and the fact that the spectrum of such a wave is independent of position, 
the formula (9) for the cross-spectral density of the scattered field in the far zone applies. 
The Fourier transform which enters that formula is now given by the expression 


C f (K,. K 2 , w) — 


II 

J D*J D 


r,' + r. 


P F (r2 ~ r i w ) e 


-itK r r,'+K 2 -r') d 3 ' d 3 ' 


(26) 


The integrations on the right can be considerably simplified by changing the variables of 
integration. Let us set 


r 


>/ + r'). 



Then we have the inverse relations 




(27a) 


(27b) 


and Eq. (26) becomes, after some calculations, 

C f (K 1? K 2 , oj ) = f f / F ( r. «)/i F (r', w) e _i[K '''H r VK,-<r^ r '>i d 3 rd 3 r ^ (28) 


where we have used the fact that the Jacobian of the transformation (27) is unity, as may 
readily be verified. 

From Eq. (28) one finds that 

C f (K, . K 2 . id) = I F (K, + K ; , w)/i f f 1 - (K 2 - Kj). (» i. (29) 


where 

/ f (K, w) = f ! f ( r, w)e“ lK ' r dV (30a) 

and 

ft F ( K'. to) = f n F {r\ to)e” lK ' r ’ d-V', (30b) 

J D 

are the Fourier transforms of 1 F and fi F respectively. On substituting for C F from Eq. (29) 
into Eq. (9), we obtain the following expression for the cross-spectral density of the scat¬ 
tered field in the far zone of a quasi-homogeneous medium illuminated by a polychromatic 
plane wave: 


W^Hrs v rs 2 , w)--S a, (w) I F [k(s 2 - s,), to] fi F 


S 1 + S 2 


- S„ 


(31) 











126 


Coherence effects in scattering 


From Eq. (31) it follows that the spectral density is given by the formula 


S w (rs, co) = — S (il (o»/ F (0, io)fi F [k(s - s n ), co]. 


(32) 


where, according to Eq. (30a), 

I F {0, co) = J D (|E(r, w )P) m dV. (33) 

The spectral degree of coherence of the scattered field can also be readily determined 
from Eq. (31) and one finds that 


.08 ,.o - [f^ 8 ! _ V ^ 


^ <sl (rs,, rs.,, w) = 


I AO, co) 


G(Sj , s 2 ; Sg, co ), 


(34) 


with 


G(s p s 2 ; S 0 , co) 


fl F 

J S 1 +S 2 
2 

- s o 

, CO 

(35) 

s, - s 0 | co] ^ 

l jl F [k{s 2 - s 0 ), to] 


Because for a quasi-homogeneous scatterer /qtfr', co) is a fast function of its spatial argu¬ 
ment, it follows from a well-known reciprocity theorem concerning Fourier-transform 
pairs that ji F ( K. to) is a slow function of K. Under these circumstances 


fi F \k(s l - s 0 ), oj] = UAH s 2 


- s 0 ), co] » fi F 


k ( s i + s 2 ) 

2 



(36) 


Consequently Eq. (34) simplifies to 


p (s) (rs l , rs,, co) 


I F [k(s 2 - Sj), w] 

I F (0.u>) 


(37) 


Equations (32) and (37) bring into evidence the following two reciprocity relations for 
scattering of a polychromatic plane wave on a quasi-homogeneous medium: 

(1) the spectrum of the scattered field in the far zone is proportional to the Fourier trans¬ 
form of the correlation coefficient of the scatterer: and 

(2) the spectral degree of coherence of the scattered field in the far zone is proportional to 
the Fourier transform of the strength of the scattering potential. 

These reciprocity relations for scattering are analogues to the reciprocity relations for radi¬ 
ation from quasi-homogeneous sources, which we encountered in Section 5.3.2 [Eqs. (16) 
and (21) of that section]. 
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S (s \rs, co)/S^(rs 0 , co) 



ptW(rSi, rs 2 , <o) 



Fig. 6.4 Illustrating the effect of scattering of a plane polychromatic wave on a quasi- 
homogeneous. Gaussian-correlated spherical medium. The normalized correlation coeffi¬ 
cient p F and the strength factor I F of the scattering potential of the medium are taken to be 
given by the expressions a>) = e ' and/ f (r, to) = e 1 /(I ' T / ) . ( a ) xhe 

normalized spectral density S (sl (rs, <o)/& s) {rs n , a>) of the scattered field in the far zone, as 
a function of the angle 6 between the direction of scattering s and the direction of inci¬ 
dence s 0 . (b) The spectral degree of coherence r s 2 , <u) of the scattered field in the 

far zone in the directions Sj and s 2 located symmetrically with respect to the direction s 0 
of incidence and • s 0 = s 2 • s 0 = cos <jx | Adapted from T. D. Visser, D. G. Fischer and 
E. Wolf, J. Opt. Soc. Amer. A23 (2006), 1631-1638.] 


Results of numerical calculations relating to scattering from a Gaussian-correlated 
medium, based on these two reciprocity relations, are shown in Fig. 6.4. 


PROBLEMS 

6.1 A plane monochromatic wave propagating in free space is scattered by a homogeneous 
sphere of radius a, whose refractive index is close to unity. Assuming that the effect of 
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the discontinuity of the scattering potential at the boundary of the sphere may be neg¬ 
lected. determine, within the accuracy of the first-order Born approximation, the scat¬ 
tering amplitude in a direction that makes an angle 0 with the direction of incidence. 

6.2 A polychromatic plane wave is incident on a weak random scatterer which occupies a 
finite volume. Derive an expression that indicates how the normalized spectrum of the 
scattered field in the far zone differs from the normalized spectrum of the incident field. 

6.3 A polychromatic plane wave is incident on a deterministic scatterer, which occupies a 
finite domain D. Derive an expression, valid within the accuracy of the first Born 
approximation, for the degree of coherence , y(r 1 s Il , r 2 s 2 , r), (sf = s) = 1), of the scat¬ 
tered field in the far zone. 

Consider also the special cases when (1) r x = r 2 \ and (2) s, = s 2 . Discuss the phys¬ 
ical significance of these two cases. 

6.4 Consider scattering of a plane, spatially coherent wave on a weak, homogeneous scat¬ 
terer, with correlation function C F (r,, r 2 , to) = C F ( r 2 - r b w) of the scattering potential. 

(a) Derive an expression for the cross-spectral density of the spectrum and for the 
spectral degree of coherence of the scattered field in the far zone. 

(b) Consider also the special case when the scatterer is delta-correlated, i.e. when 

C^r,, r 2 , to) = A(to)^ 3) (r 2 - r,), 

<5 (3) being a three-dimensional Dirac delta function and A(u>) a function which is 
independent of position. 

6.5 A coherent plane wave with spectral density 5' (0) (to) propagates into the half-space 
z > 0 and is scattered on a finite random medium, located in that half-space. The cor¬ 
relation function of the scattering potential Ffr, tu) is Qfr,, r 2 , w). 

Show that, within the accuracy of the first Born approximation, the spectrum 
S (s) (r, w) of the scattered field may be expressed in the form 

S (s) (r, cu) = 6’-°- , (w)Mr, w); 

and derive an explicit expression for the “spectral modifier” M( r, to) for this case. 
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7.1 Introduction 

In the preceding chapters we have been concerned with coherence phenomena which depend 
on correlations between field variables at two space-time points, say (r b tj) and (r 2 , ? 2 ). 
One then speaks of second-order coherence phenomena. When the fluctuations are statis¬ 
tically stationary and the polarization features of the field are not taken into account, such 
phenomena may be characterized by the “space-time” correlation function (the mutual 
coherence function) [Eq. (6), Section 3.1] 

Itru r 2 , r) = {V*(r,, f)V(r 2 , t + r)) (1) 

in the space-time domain; or, equivalently, by the space-frequency correlation function, 
i.e. the cross-spectral density [Eq. (13) of Section 4.1], 

W(r b r 2 , w) = (17 ( r,, co)U(r 2 . a> )). (2) 

The two correlation functions given in Eqs. (1) and (2) form a Fourier-transform pair [Eq. (2) 
of Section 4.1]. 

Although the coherence phenomena that have been discussed so far can be described in 
terms of such correlation functions, there are other coherence phenomena which must be 
analyzed by using other correlation functions. Because the theory involving them is some¬ 
what complicated, we will consider only correlation phenomena which can be described by 
the so-called fourth-order correlation functions, which are defined by the expression' 


r ( 2 - 2 ) (r|, q; r 2 , t 2 ; r 3 , f 3 ; r 4 , q) = (V (r,, q)V*(r 2 , q)V(r 3 , l 3 )V(r 4 , q)). (3) 

The first superscript on F (2 - 2) in Eq. (3) indicates that the correlation function contains two 
complex conjugates of the field variable V, namely V*(iq, q) and ^‘(ri, ? 2 ). The second 


For a general statistical description of stochastic fields in terms of correlation functions of all orders see M&W, 
Sections 8.2 and 8.3. 
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superscript indicates that it contains two (non-conjugated) field variables V(r 3 , r 3 ) and V(r 4 , r 4 ). 
In this notation 

r (U, (n> /j; r 2 , t 2 ) = (V*(n, n)' / (r 2 . tn)) (4) 

is the mutual coherence function defined by Eq. (1), when the field ensemble is not neces¬ 
sarily stationary. 

In genera], no simple relation exists between the correlation functions r (2-2) and F £1 ’ 1) . 
However, there is an important and broad class of random processes, known as Gaussian 
random processes , for which the correlation functions r (M Ar> , with M and N being non¬ 
negative integers, can be expressed in terms of the lowest-order one. Such a random process 
is characterized by the property that the joint probability densities 
/?„(Vj, V 2 ,. . ., v,- r,, t,; r 2 , f 2 ;. . r,„ t„) 

are all Gaussian distributions. 1 Random processes of this kind turn up very frequently in 
nature. A reason for this was mentioned, for example, in connection with Eq. (9) of Section 2.1. 

A basic property of a Gaussian random process in any number of variables (known as a 
multivariate Gaussian random process ) is that it is completely specified by its first-order 
and second-order moments (correlations). For example, for such a process, assuming for 
simplicity that the first moment (the mean) is zero, the following relationship exists: 


r <2 ’ 2, (r,, t v r 2 , t 2 : r 3 , r 3 ; 


r 4 . r 4 ) = <V'*(r 1 , r,, r 2 )V(r 3 , %)V(r 4 , t % )) 

= (V\r v h )Vlr 3 , t 3 )){V*(r 2 , t 2 )V( r 4 , r 4 )} 

+ {V*(r lt t 4 ))(V*(r 3 , t 2 )V(r y r 3 )). 


(5a) 


r a2, (r p tyl r,, f,; r 3 , t 


2 ’ 3 ’ 


f 4 ) = r ,u »(r r t x - r 3 , t 3 )P u >(r 2 . r 2 ; r 4 , t A ) 

+ r tuj (r,, t- r,, t,)T iU >(r,, tp, r 3 , t,). 


(5b) 


where r (U) is defined by Eq. (4). As we will see shortly, this is a very important relationship 
which is of particular relevance to intensity interferometry, which will be discussed in Section 
7.3. It is a special case of a general formula known as the moment theorem, which holds for 
any multivariate Gaussian random process. It expresses correlations of any order ( M, N) in 
terms of correlations of order (1, 1). 

As we mentioned in Section 2.1, thermal radiation is a realization of a Gaussian random 
process, which is a consequence of the fact that at room temperature such radiation is 
largely produced by the process of spontaneous emission. 


A comprehensive discussion of Gaussian distributions of an> number of real or complex variables is given 
in C.L. Mehta, “Coherence and Statistics of Radiation" in Lectures in Theoretical Physics, Vol. VIIC, 
W.E. Britten ed. (University of Colorado Press. Boulder, CO. 1965), pp. 34-5—401. See also C.L. Mehta, in 
Progress in Optics, E. Wolf ed. (North-Holland, Amsterdam. 1970), Vol. VIII, Appendix A. pp. 431—434, and 
K. S. Miller, Multidimensional Gaussian Distributions (J. Wiley, New York, 1964). 
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7.2 Intensity interferometry with radio waves 1 

In the early days of radio astronomy, around 1950, attempts were made to measure the 
angular diameters of radio stars. It seemed that the use of a Michelson-type stellar interfer¬ 
ometer (see Section 3.3.1) for such measurements would not be practical, because it was 
suspected (incorrectly, as it turned out) that one would have to use extremely long base¬ 
lines. At that time it was known only that angular diameters of radio stars could not be 
much greater than a few minutes of arc and some astronomers believed that they might be 
as small as those of visible stars. At meter wavelengths an interferometer with a baseline of 
hundreds or perhaps thousands of kilometers would then be required. Obviously, it would 
not be possible then to use a Michelson-type stellar interferometer, because the required 
stability could not be maintained over such long distances. 

In the early 1950s a British engineer, Robert Hanbury Brown, considered the possibility 
of using a different type of stellar radio interferometer. In the Michelson stellar interferom¬ 
eter, light arriving from a distant star at two outer mirrors M\ and M ? _ of the interferometer 
(see Fig. 3.10) propagates to the detection plane, where it forms an interference pattern. 
From measurements of the fringe visibility, the angular diameter of the star can be deduced 
by use of the van Cittert-Zernike theorem. In the Hanbury Brown interferometer the sig¬ 
nals arriving at the antennas A ( and A 2 (see Fig. 7.1) are compared with each other by the 
use of a correlator (multiplier). The first interferometer of this type w'as described in 1952. 2 
It was used to determine the angular diameters of two stars, using antennas separated by a 
few kilometers. Since that time, the correctness of the results has been confirmed by other 
interferometers. 

The principle of operation of the intensity interferometer with radio waves may be 
understood from the following considerations. Let 

I(r j ,t) = V*(r j ,t)V(r J ,t), (j = 1,2) (1) 

be the instantaneous intensities of the radio waves, assumed to be members of a statistically 
stationary ensemble, arriving at the two antennas Ai and /L. located in the neighborhoods 
of points r t and r 2 , respectively. For simplicity we will ignore the vector nature of the wave- 
field, representing it by a scalar wave function. If the mean values (7(r b t)) and (/(r 2 , 0) of 
the intensities of the waves arriving at the antennas are subtracted electronically, the inten¬ 
sity fluctuations are given by the expressions 

A/(r ; , t) = I(r r t ) - (I(r r t)) (j ~ 1,2). (2) 

The correlation of the intensity fluctuations at the two antennas is given by the expression 
<A/(r b ?)A7(r 2 , t + r)) = ([/( r b t) - (/( r b r))][/(r 2 , t + r) - (/(r 2 , r + r)>]>, (3) 


1 A comprehensive account of intensity interferometry, both with radio waves and with light, is given in 
R. Hanbury Brown, The Intensity Interferometer (Taylor and Francis. London, 1974). 

2 R. Hanbury Brown, R. C. Jennison and M. K. Das Gupta, Nature 170 (1952), 1061-1063. See also R. Hanbury 
Brown and R.Q. Twiss, Phil. Mag. 45 (1954), 663-682. 
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A, A 2 



Fig. 7.1 Block diagram of a radio intensity interferometer. [Adapted from R. Hanbury 
Brown, The Intensity Interferometer (Taylor and Francis, London. 1974), p. 84.] 

where r denotes the time delay between the waves arriving at the two antennas. On perform¬ 
ing the multiplication on the right-hand side of Eq. (3) and using the fact that the assumed 
stationarity of the field ensemble implies that {/(r 2 , t + r)) = {/(r 2 , t)), Eq. (3) gives 

(A/(r p t)AI( r 2 , t + r)} = {I(r v t)I( r 2 , t + r)) - </(r, , ?))</(r,, t )) 

- (/(r,, ?)){/(r 2 , 0) + </(r,, f))</(.r 2 , 0) 

= {/(r,, t)I(r 2 , t + t))~ (/(r p f))</(r,, r)). (4) 

Let us express the right-hand side of Eq. (4) in terms of the fluctuating field incident on the 
antennas. One then finds that 

{A/(r p f)A/(r 2 , t + r)) = {V*(r p OVt^, f)V(r 2 , r + t)V{ r 2 , r + r» 

- (V*(r p f)V(r,, f))(V*(r 2 , ?)V(r 2 , 0). (5) 

We note that the first term on the right is the fourth-order correlation function which we 
encountered earlier [Eq. (3) of Section 7.1)]: 

{V" (r b f)V(ri, f)V*(r 2 , t + t)V( r 2 , t + r)) = r <2 - 2) (r b t , r 2 , t + r; r,, r, r 2 , / + r). (6) 
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It is reasonable to assume that the wavefield reaching the antennas arises from the super¬ 
position of many independent contributions of the stellar radio source; and consequently, 
according to the central limit theorem, mentioned in connection with Eq. (9) of Section 2.1, 
the field arriving at the antennas will be a realization of a Gaussian random process. Under 
these circumstances, and assuming the wavefield to be statistically stationary', the fourth-order 
correlation function (6) may be expressed, according to Eq. (5a) of Section 7.1, in the form 

<V*(r P t)V( r p t)V*( r 2 , t + r)V(r 2 , t + r)> 

= (V*(r p t)V( r p r))<V*( r 2 , t + r)V(r 2 , t + r)} 

+ <V*(r p t)V( r 2 . r + r)){V*(r 2 , t + r)^,, t)) 

= (I(r v /)}</(r 2 , 0) + r^Ki-p r 2 , T)r tu, (r 2 . r p -r) (7) 

or, since I' 1 " 11 obeys the relation 

r (l,1, (r 2 , r b —r) = r (1,1, *(r lt r 2 , r), (8) 

one has 


{y*(r p t)V(X v t)V*( r 2 , t + r)V(r 2 , t + r)} 

= (/(r p 0){/(r 2 , 0) + |rM(r, r r 2 , rf. (9) 

On substituting from Eq. (9) into Eq. (5) we obtain for the correlation of the intensity fluc¬ 
tuations the formula 


<A/(r,, t)A/(r 2 . t + r)} = |r na »(r 1 , r 2 , r)| 2 . (10) 

Now the second-order correlation function r ,Ul (r,. r 2 , r) which appears on the right-hand 
side of Eq. (10) is just the mutual coherence function which we denoted before by r(r b r 2 , r); 
its normalized version 


rpj, r 2 , t) _ = r(r,, r 2 , r) 

V r (r,, r ,, 0)^T(r 2 , r 2 , 0) t))(I( r 2 , /)} 

= 7 (r,, r 2 , t) (11) 

is just the degree of coherence of the wavefield at the points r, and r 2 . Hence Eq. (10) may 
be expressed in the form 

{AI(r v t)Al( r 2 , r + r)} = </(r p t)){I( r,, t))\^r v r 2 , r)| 2 , (12) 


which implies that 


(A/(r p t)AI(r z , t + r)) 
(/(r p f ))</(r 2 , r)) 


= It(* 


(13) 
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Equation (13) is the basic formula of intensity interferometry for thermal radiation. It 
shows that the absolute value of the degree of coherence of a thermal field at two field 
points r i and r 2 may be determined from measurements of the correlations of intensity fluc¬ 
tuations and the average intensities at these points. 

We recall that by use of the Michelson stellar interferometer one can, in principle, deter¬ 
mine the complex degree of coherence 'tfr,. r 2 , r) at the two mirrors from visibility meas¬ 
urements although, in practice, atmospheric turbulence essentially destroys information 
about its phase. In intensity interferometry, as we just showed, one can determine the mod¬ 
ulus | 7 (ri, r 2 , t)| of the degree of coherence. By use of the van Cittert-Zemike theorem one 
can then obtain information about the intensity distribution across the source, assuming 
that it is spatially incoherent. However, as is evident from Eq. (13), such measurements 
cannot provide, even in principle, any information about the phase of the degree of coher¬ 
ence. This limitation is. however, of no consequence, if the stellar disk is rotationally sym¬ 
metric, as we briefly discussed in Section 3.3.1. 

7.3 The Hanbury Brown-Twiss effect and intensity 
interferometry with light 

After the first successful determination of the angular diameters of radio stars by intensity 
interferometry, Hanbury Brown and Twiss turned their attention to the possibility of using a 
similar technique to measure the angular diameters of stars at optical wavelengths. It was 
clear that, if this were possible, an appreciable amount of valuable new information could be 
obtained. Up to that time the only way to measure stellar diameters at optical wavelengths 
was by use of the Michelson stellar interferometer. Practical difficulties had restricted the 
use of that method to measurements made by Michelson and Pease in the 1920s determin¬ 
ing the angular diameters of only six stars. A source of the difficulties in such measurements 
was the extremely small angles subtended by stars at the Earth’s surface - of the order of 
10 -4 seconds of arc; this requires the use of interferometers with baselines of several hun¬ 
dred meters. Further, atmospheric turbulence produces blurring and distortions of the image 
of the star. It was chiefly for these reasons that no progress made in applying Michelson’s 
method to determining the angular diameters of other stars than the small number initially 
measured. 

At first glance it seemed that intensity interferometry would overcome some of the lim¬ 
itations of Michelson’s method and this indeed turned out to be the case. However, the theory 
of intensity interferometry with light rather than with radio waves turned out to be rather 
subtle, mainly due to the quantum-mechanical nature of detection of light fluctuations. To 
begin with we will ignore this complication and we w ill assume that Eq. (13) of Section 7.2, 
which we derived for intensity interferometry with radio waves, applies even when a very 
different detection process is employed, using the photoelectric effect. In Section 7.5 we 
will explain why Eq. (10) of Section 7.2. which we derived on the implicit assumption that 
the detector measures correlations in the intensity fluctuations of the classical field, can be 
used even when photoelectric detectors are used, as long as some subtleties connected with 
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the signal-to-noise ratio are ignored. Such subtleties must, of course, be taken into account 
in practice, but omitting this aspect of the technique does not prevent an understanding of 
the basic physical principles underlying it. 

In the spectral region of radio waves, the procedure of determining the correlation in the 
intensity fluctuations is straightforward, because square-law detectors measuring intensity 
fluctuations A/(r, t) of radio waves are well-known electronic devices. One need only set 
up two radio antennas in the neighborhood of the points r, and r . The output signals are 
then multiplied in an electronic correlator and the product is averaged and recorded. 

The situation is quite different with light. Light detectors with a sufficiently rapid response 
are photoelectric detectors. Such detectors are, however, highly “non-classical” devices, 
making use of the photoelectric effect. This is the phenomenon of ejection of electrons 
from a metal when electromagnetic radiation of short enough wavelength impinges on its 
surface It has been known for a long time that the energy of the ejected electron is inde¬ 
pendent of the intensity of the light that illuminates the metal surface but depends on the 
frequency of the light. When the intensity is increased, the number of electrons ejected 
increases, but not their energy. These observations are in contradiction with the classical 
wave theory of light. Einstein, in a famous paper published in 1905, explained these obser¬ 
vations on the basis of the corpuscular nature of light, i.e. by postulating that light incident 
on the photoelectric surface consists of particles - light quanta - now called photons. 
Einstein’s paper was the first paper which clearly indicated the need for quantizing the elec¬ 
tromagnetic field under some circumstances. In 1921 he was awarded the Nobel Prize for 
Physics “for his services to Theoretical Physics and especially for his discovery of the law 
of the photoelectric effect.” 

We derived the basic formula (13) of Section 7.2 using classical wave theory for the 
detection of radio waves. However, because of the non-classical nature of the photoelectric 
effect, it is by no means clear that the formula would also hold for light waves detected by 
use of this effect. In fact there was a bitter controversy, which we will mention later, regard¬ 
ing this question; it was eventually resolved in favor of the validity of the classical formula 
even under these circumstances. The justification involved some beautiful physics relating 
to the wave-particle duality of light which we will discuss in Section 7.4; it also required 
the development of the theory of photoelectric detection of light fluctuations, which was 
formulated largely in order to clarify some of the questions surrounding this problem. 

In 1956. shortly after the successful determination of diameters of radio stars by inten¬ 
sity interferometry. Hanbury Brown and Twiss performed laboratory experiments to deter 
mine whether the technique worked also with light. 1 Light from a mercury arc was filtered 
and split into two beams by a half-silvered mirror and the two beams were then incident on 
two photo-multiplier tubes, one of which was movable across the beam (see Fig. 7.2). As 
the photo-multiplier moved, the degree of coherence of the light at the two photo-cathodes 
varied in a manner that could be calculated from the geometry of the arrangement. The elec¬ 
trical signals from the two photo-multipliers were fed into electronic circuits, where they 


R. Hanbury Brown and R. Q. Twiss, Nature 111 (1956), 27-29. 
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Fig, 7.2 Apparatus used for the demonstration of the Hanbury Brown-Twiss effect. 
[Adapted from R, Hanbury Brown and R.Q. Twiss, Nature 177 (19561, 27-29.] 


were amplified. The outputs were multiplied together in a multiplier. With this equipment the 
intensity correlation was measured for various positions of the movable photo-multiplier 
and compared with the theoretical values calculated from the geometry of the system, the 
spectral distribution of the light and the electrical characteristics of the circuits. The results 
are shown in Fig. 7.3 and are seen to agree well with predictions of the classical theory. 1 

In spite of the successful laboratory experiment a controversy surrounding the effect 
continued and papers were published reporting results of similar experiments but with neg¬ 
ative results, suggesting that the effect does not exist. It was shown later that the negative 
results were largely due to substantial underestimation of the time needed to obtain a sig¬ 
nificant signal-to-noise ratio. 2 

Before discussing the resolution of the controversy surrounding the Hanbury Brown-Twiss 
effect and outlining why the “classical theory" applies to understanding the principles of 
the optical intensity interferometer, we will briefly describe the first two such interferometers, 
which were built in the period 1956-1966. 


1 A similar experiment was performed by W. Martienssen and E, Spiller. Am. J. Phys. 32 (1964), 919-926 by using 
so-called pseudo-thermal sources. Such sources were produced from laser sources by changing the statistical 
properties of the light by passing it through a rotating diffuser. In this way light was generated whose statistical 
properties were governed by Gaussian distributions; but, unlike the usual thermal sources, it had a very high sta¬ 
tistical degeneracy (see Appendix I), which made it appreciably easier to demonstrate the effect. 

1 It was estimated that in one of the experiments [A. Adam. L. Janossy and P. Varga, Arm. Phys. (Leipzig) 16 
(1955), 408—413] 10' 1 years (longer than the estimated age of the Earth) would have been needed to obtain a 
significant signal-to-noise ratio. In experiments described by E. Brannen and H. I. S. Ferguson. Nature 178 
(1956), 481—482. the estimated time required for obtaining a significant signal-to-noise ratio was 1,000 years. 
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Fig. 7.3 The experimental and theoretical values of the squared modulus of the degree 
of coherence |7n(0)| 2 for various separations between the photo-cathodes. [Adapted from 
R. Hanbury' Brown and R.Q. Twiss, Proc. Roy. Soc. (London) A243 (1958), 291-319.] 


A block diagram of the detector part of an optical stellar-intensity interferometer is 
shown in Fig. 7.4. The first interferometer of this kind 1 utilized devices that were somewhat 
primitive, namely two parabolic reflectors of searchlights used by the British Army. Their 
chief purpose was to concentrate light from a star onto two photodetectors, without aiming 
at producing a good image. For, as we have already noted, the phase of the incident light 
plays no role in the determination of the angular diameter of the star by this technique, 
because the correlation of the intensity fluctuations at the two detectors depend only on the 
modulus of the degree of coherence of the incident light, not on its phase [see Eq. (13) of 
Section 7.2]. For the same reason the atmospheric fluctuations are of less consequence in 
such measurements than in measurements that use the Michelson stellar interferometer. 

Using this interferometer Flanbury Brown and Twiss found that the angular diameter of 
the star Syrius is 6.9 X 10~ 3 seconds of arc, using baselines of up to about 9 m (see Fig. 7.5). 

Having convincingly demonstrated the possibility of determining the angular diameters 
of visual stars by intensity interferometry, Hanbury Brown and Twiss and their collabora¬ 
tors built a large interferometer of this kind at Narrabri in Australia, about 370 miles by 
road north of Sydney. The general layout of the interferometer is shown in Fig. 7.6 and can 
best be described by quoting the following passage from Hanbury Brown’s book (p. 94) on 
this subject cited earlier: 

The photoelectric detectors were each mounted at the focus of two very 7 large 
reflectors carried on trucks running on a circular railway track with a gauge of 
5.5 m and a diameter of 188 m. These mobile trucks were connected to the control 
building by cables suspended from steel catenary wires which were attached at one 


R. Hanbury Brown and R.Q. Twiss, Mature 178 (1956), 1046-1048: Proc. Row Soc. (London), A248 (1958). 
222-237. 
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Fig. 7.4 Block diagram of an optical stellar intensity interferometer. [Adapted from 
R. Hanbury Brown, The Intensity Interferometer (Taylor and Francis, London, 1974), p. 48.] 





Baseline (m) 

Fig. 7.5 The squared modulus of the degree of coherence of light from the star Sirius 
A. The points indicate measured values, together with probable errors. The curve shows 
the theoretical values for a star of angular diameter 0.0069 seconds of arc. [Adapted 
from R. Hanbury Brown and R. Q. Twiss, Proc. Roy. Soc. (London) A248 (1958), 
222-237.] 


end to a bearing at the top of a tower in the center of the circle, and at the other to 
a small tender towed by each truck. When not in use the reflectors were housed in 
a garage built over the southern sector. A valuable but expensive feature of this 
garage was a slot running almost the full length of one wall enabling the trucks to 
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Reflector truck 



Fig. 7.6 The general layout of the stellar intensity interferometer at Narrabri. [Adapted 
from R. Hanbury Brown. Sky & Telescope 28 (1964), 64—69.] 


be parked inside without detaching the cables and hence without disturbing the 
electric connections. 

The control building which housed the control desk, the computer, a large air- 
conditioning plant and various motor generators, switchboards, etc. was a two- 
story brick building of solid construction with a good heat-reflecting roof so that it 
was possible even in the summer at Narrabri to hold the inside temperature to 
72 ± 2°F. It was close enough to the central tower to allow the catenary cables to 
pass over its roof [see Fig. 7.6]. 

The reflectors [shown in Fig. 7.7] were regular 12-sided polygons roughly 6.5 m in diame¬ 
ter. each having a useable reflecting area of 30 nr. They were mounted on turntables car¬ 
ried by trucks. 

The focal length of each mirror was about 11 m and the surface of each reflector was a 
mosaic of 252 hexagonal mirrors (Fig. 7.7), each approximately 38 cm between opposite 
sides and 2 cm thick. The angular diameters of 32 stars were determined by this instrument 
and this information represents a major contribution to astronomy. 

Although the Hanbury Brown-Twiss effect was discovered, and has been used, in con¬ 
nection with determining stellar diameters, it has found other applications since then, for 
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Fig. 7.7 Twin light collectors of the stellar-intensity interferometer on the circular track at 
Narrabri Observatory, New South Wales, Australia. Some of the small mirrors have been 
removed from each of the 6.5-m reflectors for recoating. A beam 11 m long supports a 
photoelectric detector at the focus of each reflector. [Adapted from R. Hanbury Brown, 

Sky & Telescope 28 (1964), 64-69.] 

example in high-energy physics, nuclear physics, atomic physics and condensed-matter 
physics. 1 

7.4 Einstein’s formula for energy fluctuations in blackbody radiation and 

the wave-particle duality 

Let us now return to the question of why the performance of an optical intensity interfer¬ 
ometer can be described by classical wave theory, even though the photoelectric detectors 
act in a highly non-classical manner. A turning point in the controversy surrounding this 
subject was the publication of a short note by E. M. Purcell. 2 He showed by a simple heuristic 


1 See, for example. G. Baym, Acta Phys. Polon B29 (199S), 1839-1884; M. Schellekens. R. Hoppeler, A. Perrin, 
J. Viana Gomez, D. Boiron, A. Aspect and C.I. Westbrook, Science 310 (2005), 648-651; A. OttI, S. Ritter, 
M. Kohl and T. Esslinger, Phys. Rev. Lett. 95 (2005), 090404. 

2 E.M. Purcell, Nature 178 (1956), 1449-1450. 
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argument that, when a beam of light is incident on a photodetector, the statistical fluctua¬ 
tions in the counting rate of the emitted electrons will be greater than one would expect in 
a random sequence of independent events occurring at the same average rate. The addi¬ 
tional fluctuations give rise to the cross-correlation in the intensity fluctuations found by 
Hanbury Brown and Twiss, which is essentially a classical wave contribution. 

The presence of both a particle contribution and a wave contribution in the photoelectric 
detection of light fluctuations brought out by Purcell’s analysis may be regarded as a man¬ 
ifestation of the wave-particle duality, which is a basic feature of quantum mechanics. It 
was discovered by Einstein 1 well before the formulation of quantum mechanics, in a beau¬ 
tiful paper dealing with energy fluctuations in blackbody radiation. Because Einstein’s 
paper has a close bearing on Purcell’s argument regarding the origin of the Hanbury 
Brown-Twiss effect, we will briefly present its essence here. 

Einstein considered blackbody radiation in equilibrium in a cavity of large volume V. In 
any subregion of volume v <S V the energy will fluctuate, moving in and out of the subre¬ 
gion. He inquired about the magnitude of the fluctuations in this region in a small fre¬ 
quency range (v, v + dv), choosing as the measure of the fluctuations the variance of the 
energy E in that region, i.e. the quantity 

(AE) 2 = (£ - £) 2 , (1) 

where, for convenience, we now use an overbar rather than angular brackets for an ensem¬ 
ble average. Einstein showed first from thermodynamic considerations that 2 

(AE^ = V' 2 § 22 , (2) 


where k B is the Boltzmann constant and T is the absolute temperature. Equation (2) is often 
called the Einstein-Fowler formula. 

Now, for blackbody radiation, E is given by Planck’s law 


E 


= Z 


hv 

e hv/(k B T ) _ ] 


(3) 


where h is Planck’s constant and 


„ %nVv 2 du 

z ' — l4 > 

c being the speed of light in vacuum. The quantity Z may be shown to represent the so- 
called number of cells in phase space associated with the volume Vin ordinary space and 


1 A. Einstein. Phys. Z. 10 (1909), 185-193; English translation in The Collected Papers of Albert Einstein, Vol. 2 
(Princeton University Press, Princeton. NJ, 1989), 357 -375. 

2 For a derivation of this formula see, for example, F. Reiche, The Quantum Theon> (Methuen. London. 1922), 
Note 52, 139-140. 
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the frequency range (u, v + dy) [see Appendix 1(b)], On substituting for the mean energy 
E from Eq. (3) into Eq. (2), one obtains for the variance of the energy fluctuations in the 
subregion the expression 


(A E) 2 =hvE + ^E 2 . (5) 

This formula is known as Einstein’s fluctuation formula for blackbody radiation. 

Einstein drew some remarkable conclusions from this formula. He pointed out that if the 
cavity V contains only waves, as most physicists at that time believed, one could decom¬ 
pose the field within the cavity into a set of plane-wave modes of different amplitudes and 
phases, propagating in different directions. Short-term interference of the waves would 
then produce field fluctuations and, consequently, energy fluctuations in the subregion v of 
the cavity. Einstein argued, by using a simple dimensional argument, that the variance of 
these fluctuations will be given exactly by the second term in Eq. (5), namely by the formula 1 

(A£) 2 = — E 2 . (6) 

Einstein next argued as follows: since only the second term on the right-hand side of Eq. (5) 
can be understood on the basis of wave theory, that theory does not fully explain the origin 
of the fluctuations. Suppose that the volume contains n particles (quanta), each of energy 
hv. If they were independent classical particles, the fluctuations in their number would 
obey the Poisson distribution [see Appendix IV(a)], viz.. 


pin) 


»"e " 
ft! 


(7) 


Einstein’s heuristic argument by means of which he interpreted the meaning of the second term on the right of 
his fluctuation formula (5) occupies only eight lines of his paper. Another plausibility argument, indicating that 
the variance of fluctuations of the waves is proportional to E 2 , may readily be given on the basis of Eq. (12) of 
Section 7.2. That formula is an expression for the correlation of intensity fluctuations in a field governed by a 
multivariate Gaussian distribution (i.e. a Gaussian random process), as might be expected to be the case for 
blackbody radiation. If we set in that formula rj = r, = r and r = 0, then flr, r, 0) = 1 and the equation 
reduces to (using now an overbar instead of angular brackets to denote an ensemble average) 


(A I) 2 = I 1 . 

where / is the instantaneous intensity. Evidently I may be taken to be a measure of the instantaneous energy E. 
Writing E = a/, where a is a constant, the above formula reduces to 


(A E) 2 ~ E 2 . 

A long but rigorous derivation of the formula (6) was given by H. A. Lorentz in his book Les theories statis- 
tiques en thermodynamique (Teubner, Leipzig and Berlin, 1916), 114-120. 
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The total energy of n such particles would be E = nhv, so that 

E 

n = —. 
hv 

Now the variance of a Poisson distribution is 

(An) 2 = n. 

and it follows from Eqs. (8) and (9) that the variance [A (El(hv))} 1 = E/(hv), i.e. 


( 8 ) 

( 9 ) 


(panicle, = hvE ■ 


( 10 ) 


Here we have attached the subscript “particles” to stress that the expression was derived on 
the assumption that the energy fluctuations are caused by particles. We see that the expres¬ 
sion on the right of Eq. (10) is precisely the first term on the right-hand side of Eq. (5). 
Hence, using also Eq. (6), we can express Eq. (5) as 

(A E) 2 = (A EY- + (A E) 2 . (1D 

particles waves V 11 ) 


This result was the first example of the wave-particle duality, which about 15 years later 
became a prominent feature of quantum mechanics. 

Equation (11) is a kind of a hybrid result that expresses the variance of energy fluctua¬ 
tions in a cavity containing blackbody radiation as a sum of contributions of classical par¬ 
ticles and classical waves. Today one would interpret Einstein’s result somewhat differently, 
in terms of either non-classical waves or non-classical particles. For example, by using Eq. 
(8) we could rewrite Eq. (5) in the form 

(An) 2 = it + (12) 

The presence of the second term on the right of this formula shows that the particles do not 
behave as classical particles which obey a Poisson distribution because, for such particles, 
Eq. (9) rather than (12) would apply. Equation (12) for the variance is, in fact, a well-known 
formula pertaining to quantum particles known as bosons in Z cells of phase space [see 
Appendix 1(b)], 

With this background we will now return to the Hanbury Brown-Twiss experiment and 
consider, more generally, the theory of photoelectric detection of light fluctuations. 


7.5 Mandel’s theory of photoelectric detection of light fluctuations 

7.5.1 Mandel’s formula for photocount statistics 

As noted in the previous section, the controversy surrounding the theory of photoelectric 
detection of light fluctuations was largely resolved by a brief discussion by E. M. Purcell. 
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He showed that the reason why correlations in the fluctuations of a classical optical field 
can be extracted from photocount measurements was due to the excess fluctuations (arising 
from the boson nature of light) over completely random counts. 

L. Mandel in two important papers 1 carried Purcell’s argument further and developed a 
theory of photoelectric detection of light fluctuations, which he appropriately referred to as 
stochastic association of photons with random waves. Mandel’s analysis not only gave a 
clear explanation of the Hanbury Brown-Twiss effect, but also provided a general theory of 
the interaction of stochastic light beams with photoelectric detectors. We will now give a 
brief account of it. 

Consider first a single photodetector exposed to an incident light wave, assumed for sim¬ 
plicity to be linearly polarized, with fluctuating complex amplitude V(f), represented by an 
analytic signal (Section 2.3). It seems plausible and can be shown, for example, by semi- 
classical theory, 2 that the probability that an electron will be emitted within a time interval 
(f, t + At) is proportional to the instantaneous light intensity I(t) = V*(t)V(t), i.e. 

P(t)At = oi/(t)Af. (1) 

a being a constant representing the quantum efficiency of the photodetector. Starting from 
Eq. (1) Mandel showed that the probability that n electrons are emitted within a time inter¬ 
val (t. t + T) is given by the formula 

pin, t, T) = —[aWit, r)]' i e~ omr ’, (2) 

71 ! 


where 


p t-\-T 

Wit, T) = J /(t')dr' (3) 

is the integrated intensity of the light incident on the detector over the time interval (t. 
t + T). The derivation of Eq. (2) is somewhat lengthy. It is given in Appendix III. 

It is of importance to appreciate that Eq. (2) pertains to a single realization of an incident, 
generally random, field. Physically more significant is the average, which we will denote 
by an overbar, of the expression given in Eq. (2) over the ensemble of the incident field: 

Pin , t. T) = p(n , f, T) 

= —[aW(t, T)]»e- aW(, - T) (4) 

n ! 


or, more explicitly, 

P(n, t, T) = f [n ^’ 7y 1" e-»wy ff) dff, (5) 

Jo n ! 


1 L. Mandel, Proc. Phys. Soe. (London) 72 (1958), 1037-1048; and Ibid. 74 (1959), 223-243. 

2 L. Mandel, E. C. G. Sudarshan and E. Wolf. Proc. Phys. Soc. (London) 84 (1964). 435—144, Eq. (2.18b). 
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where p(W) = p( W , t, T) is the probability density of the integrated intensity, given by Eq. 
(3), the variables t and T being kept fixed in the integration. We will refer to Eq. (5) as 
Mandel's formula for photocount statistics. 

The formula is of considerable generality. It applies to any kind of linearly polarized 
incident light, irrespective of its statistical properties and irrespective of whether the light 
fluctuations are statistically stationary or not. It should be noted that the formula involves 
probabilities in two different ways. First, it contains the probability relating to emission of 
photoelectrons from a single realization of the incident field; and second, it involves the 
ensemble of the incident field. 

The integrand on the right-hand side of Mandel's formula contains the expression 
T)] n c~ aW<J - 12 in\, which is a Poisson distribution with mean 

n = aW(t,T). (6) 

However, Mandel’s formula (5) is, in general, not a Poisson distribution, because of the 
presence of the weighting factor p(W) under the integral sign. The formula may be said to 
represent a Poisson transfonn of the probability p(W). 


7.5.2 The variance of counts from a single photodetector 

The variance of the counts, recorded in the time interval ( t , t + T) is given by the formula 

(An) 2 = (n — n) 2 = n 2 — n 2 , 

(7) 

where the first two moments, 


X 

n = ^2 np{n , f , T) 

n= 0 

(8a) 

and 


_ X 

n 2 = ^2 n 2 P( n ’ 

«=0 

(8b) 

of the averaged Poisson distribution p(n. t. T), defined by Eq. (4), 
known formulas 1 

are given by the well- 

h = aW(t, T), 

(9a) 

n 2 = aW(t, T) + a 2 [W(t,T)] 2 . 

(9b) 


1 See, for example, A. Papoulis, ProbabiUtv, Random Variables and Stochastic Processes (McGraw-Hill, New 
York. 1965), 145. 
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Hence the variance defined by Eq. (7) is given by the expression 


(An) 2 = aW(t,T ) 4- cr[W(t,T)] 2 


Wit, T) 


( 10 ) 


or, using Eq. (9a), 


(An) 2 = n + cr[AW(t,T)\ 2 , 


( 11 ) 


where 


[A W(t,T)] 2 = 


W(t.T) - Wit, T) 


\W(t.T )] 


2 _ 


W(t. T) 


( 12 ) 


is the variance of the integrated intensity Wit, T) defined by Eq. (3). 

Equation (11) has a simple interpretation. It shows that the variance of the fluctuations in 
the number of ejected photoelectrons may be regarded as having two separate contributions: 
(i) from the fluctuations in the number of particles obeying the classical Poisson distribution 
(the term it) and (ii) from fluctuations of a classical wave (the wave interference term 
a 2 [AW(t, 7)] 2 ). This result which, as we have just shown, holds for any radiation field, is 
strictly analogous to the celebrated result of Einstein, relating to energy fluctuations in a 
region of a cavity containing blackbody radiation, under conditions of thermal equilibrium, 
as discussed in Section 7.4. We now see that a fluctuation formula of the kind Einstein 
derived, which clearly exhibits the wave-particle duality of radiation, also holds for count¬ 
ing fluctuations in time intervals for any light beam (i.e. thermal or non-thermal, stationary 
or non-stationary), at points that may be situated far away from the source of the light. 

It is of interest to note that w'hen the intensity of the incident beam is stabilized, e.g. 
as in a single-mode laser beam, the variance [AI¥(t. 7j] 2 will be negligible because there 
wi ll be n o intensity fluctuations in the incoming beam and the formula (11) then reduces 
to (An) 2 = »], as for a system of classical particles. 

If, on the other hand, the light is of thermal origin, the probability density pin, t, T) of the 
photocount will be quite different. For thermal light the probability distribution of the incident 
field will be Gaussian (see Eq. (9) of Section 2.1). By using elementary rules of the theory 
of probability, this may be shown to imply that the probability distribution of the intensity 
is exponential (see M&W. Section 3.1.4), i.e. that 

p(I) = yexp(-//7). (13) 

One may then show by the use of Mandel's formula (5) and formulas (9a) and (3) that if the 
integration time T is much shorter than the coherence time of the light, i.e. if T < 2tt/Aoj, 
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where Aw is the bandwidth of the incident light, the probability distribution of the photo¬ 
electron becomes 1 


Pin, t. T) = pin, t. T) = ---—, (14) 

{n + l)" +l 

which is the Bose-Einstein distribution for one cell of phase space (see Appendix IV, 
Eq. (16), or Morse’s discussion"). 


7.5.3 Correlation between count fluctuations from two detectors 

We will now consider the situation when a light beam illuminates two photodetectors, a 
situation which, as we saw, is of special interest in optical intensity interferometry 7 . Suppose 
that the detectors are located in the neighborhoods of points r ( and r 2 , respectively. We will 
indicate by subscripts 1 and 2 the outputs of each of the two detectors. 

Let us determine the correlation A« 2 of the fluctuations 

A/jj = /jj - Hj and A n, = n n — n i, (15) 

of the photocounts of the two detectors. We have 

X X _ 

rpfT 2 = Y* n i r hPMr r r T )P 2 i n 2 - L* T ) 

it=0 nf=0 

-(16) 

x x 

= Y1 n \P\i ,l \' f i• n 2 P 2 ( n 2’ h' T) ' 

»,=0 n,=0 

where the probabilities pfn r tj, T). j — 1.2. are given by expressions such as Eq. (2). Now, 
according to Eqs. (8a) and (9a), 


n \ = Y1 n \Pi( n \> r v T '> = Q i w i^p r ). (17a) 

n ] =0 


n 2 = } hP 2 ^ n 2 “ l 2 ’ = <x 2 W 2 (t 2 , T\ (17b) 

n 2 =0 

i.e. the expectation values of and n 2 are proportional to the averaged integrated intensity 
Wi t, T) at each of the two points. Hence, using Eqs. (17), we have 

n x ti 2 = T)W 2 (t 2 , T). (18) 


L. Mandel in Progress in Optics , E. Wolfed. (North-Holland. Amsterdam, 1963), Vol. 2. p. 228. Eq. (71b). 
P. M. Morse, Thermal Physics (Benjamin. New York. 1962). p. 218. 
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Now 


An, A n 2 = (n, - nfi{n 2 - n,) 


(19) 


or, using Eqs. (16) and (18), 



which implies that 


Am, Aw, = OjGjAWJO'j, T)AW 2 (t v T\ 


( 20 ) 


where 


AW.(? r T) = W.(t r T) - W.(t r T) 


( 21 ) 


are the fluctuations of the integrated intensities of the light incident on the two detectors. 

The formula (20) shows that the correlation An, Art 2 between the fluctuations of the photo¬ 
count at the two detectors is proportional to the correlation between the fluctuations in the 
integrated intensities of the classical field incident on the two detectors. 

If the integration time T of the detector is short relative to the coherence time t c = 2tt/A<w 
of the incident light, we have from Eq. (3) that 


( 22 ) 


W(t,T) <* TKt) 


and Eq. (20) reduces to 


An, Art, = a,a,r 2 A/, (r, )A/, (/,), 


(23) 


where A fit) = fit) — fit ) denotes the fluctuation of the intensity. Except for a slight change 
in notation, the right-hand side of Eq. (23) is identical with the left-hand side of Eq. (10) of 
Section 7.2 which, as we have seen, is the basic quantity from which angular diameters of 
stars are determined by intensity interferometry. 

Equation (23) confirms that the correlation of intensity fluctuations of the classical field 
incident on the two detectors can indeed be determined from measurements of correlations 
of the fluctuations in the number of electrons emitted by the two illuminated photodetec¬ 
tors, in spite of the non-classical nature of the photoelectric effect. 

Equation (20) contains the essence of the Hanbury Brown-Twiss effect. In practice it is, 
of course, also necessary to have an estimate of the signal-to-noise ratio. For discussion of 
that topic we refer the reader elsewhere. 1 It turns out that the signal-to-noise ratio depends 


1 See R. Hanbury Brown and R. Q. Twiss, Proc. Roy. Soc. (London), A242 (1957), 300-324, Section 3(d). See also 
L. Mandel, in Progress in Optics, E. Wolf ed. (North-Holland, Amsterdam, 1963), Vol. II, Section 4.3, pp. 181-248. 
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rather significantly on the degeneracy parameter 8 of the light [see Appendix I (c)] and that 
when 8 is much smaller than unity it is essentially proportional to 8. We have shown in 
Appendix I that 8 <S 1 for thermal sources at laboratory temperatures and for light from 
stellar sources. It is mainly for this reason that the effect had originally been so difficult to 
demonstrate. Since for laser light the degeneracy parameter is generally very large, of the 
order of 10 1 or greater, one might expect that the effect would be very easy to demonstrate 
with laser light. However, as we have already noted, this is not so. A good-quality laser will 
generate a beam in which the intensity fluctuations A/ are negligible. Consequently corre¬ 
lations in intensity fluctuations are essentially absent and the large value of the degeneracy 
parameter is then irrelevant. The effect can, how'ever, easily be observed with so-called 
pseudo-thermal sources, which we mentioned in Section 7.3. 

Finally we recall that in the preceding discussion we represented the incident field by a 
single scalar wave function. This is equivalent to assuming that the incident field is linearly 
polarized. For a discussion of intensity fluctuations and the photoelectric detection of light 
of some other states of polarization we refer the reader to a paper by Mandel 1 

7.6 Determination of statistical properties of light from 
photocount measurements 2 

Let us now return to Mandel’s formula (5) of Section 7.5.1 for the probability that n elec¬ 
trons are emitted in the time interval (/, t + T) under the influence of incident light: 

P(n,t,T)= f [0 ' VV(A T)i " e~""' (r > 7j p(W)dW. (]) 

Jo n ! 

Here 

n t+T 

Wit. T) = J' I{t'W (2) 

represents the intensity of the incident light, integrated over the time interval it.t + T ), and 
a is the photo-efficiency of the detector. 

In principle, the photocount distribution Pin. t. T) may be measured. We will now' con¬ 
sider how' one can derive the probability distribution p( W) of the integrated intensity of the 
light incident on the detector from knowledge of that distribution. This amounts to finding 
the inverse of the Poisson transform which appears on the right-hand side of Eq. (1). For 
this purpose let us first introduce the function 

Fix) = f e" w piW)e~ nW dW. (3) 

J 0 


1 L Mandel. Proc. Phys. Soc. (London), 81 (1963), 1104-1114. 

2 The analysis presented in this section is largely based on a paper by E, Wolf and C. L. Mehta, P/ivr. Rev. Lett. 
13(1964). 705-707. 
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Taking the Fourier inverse of F{x) we have 


p(W) = - / F(x)e- uw d.v. 


e aW P 


(4) 


Next let us expand the exponential on the right-hand side of Eq. (3) in a power series. This 
gives, formally at any rate. 




(5) 


Now Mandel’s formula (1) implies that 



( 6 ) 


and hence Eq. (5) gives 



(7) 


It follows that the required probability distribution p(W) may be obtained from knowledge 
of the photocount distribution Pin, t, T) for n = 0, 1,2,... by first evaluating the function 
Fix), defined by Eq. (7), and then using Eq. (4). 

Let us illustrate the inversion procedure by a simple example. Suppose that the photo¬ 
count distribution P(n, t, T) is Poissonian, i.e. that 


Pin, t, T) 


( 8 ) 


Equation (7) then gives 




n 


n =0 

0?iT(i-v/a)-l] 


(9) 
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On substituting from Eq. (9) into Eq, (4) we obtain for p(W) the formula 

p( W) = e aW e"' 7 — f” e -tnw-«ta) d t . (10) 

2 7r J *’* 

The integral on the right of Eq. (10) is just 2tt8(W - n/a), where 6 is the Dirac delta func¬ 
tion. Hence 


p(W) = 8(W - W). 


( 11 ) 


where 


i7> n 

W = -. (12) 

a 

This result implies that when the observed photocount distribution is strictly Poissonian, 
i.e. given by Eq. (8), which represents pure shot noise, the intensity of the incident light is 
completely stabilized in the sense that it does not fluctuate at all and W has then the con¬ 
stant value W = n/a. 


PROBLEMS 

7.1 V(r, t) is an analytic signal representation of a linearly polarized, stationary, thermal 
field and 

A/(r, t) = /(r, t) - {Hr, t)) 

represents the fluctuation of the instantaneous intensity of the field at the point r. Show 
that one may determine the absolute value of the degree of coherence 2 <r b r 2 , r) of the 
light at two points in the field from knowledge of the averaged intensities and of the 
correlation between the intensity fluctuations at the two points. 

7.2 (£) represents the average energy in the frequency range (v, v + dv) of blackbody radia¬ 
tion, at temperature T , contained in a volume V. It is known that 

(i) when hv/(k B T) < 1, IE) is given by the Rayleigh-Jeans law' 

(E) = Zk B T- 

(ii) when hv/(k B T) > 1, (E) is given by Wien's law 

(E) = Zhve 

where 


Z = 


du 
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(a) Show that the variance of the energy fluctuations is given by the expressions 


<(A£)2) 


t{E) 2 when hv/(k B T) <4 1, 

hv(E) when hv/(k B T) 8> 1. 


(b) Assuming that the fluctuations in these two limiting cases are due to two different 
causes and that, in general, both are effective and give rise to fluctuations whose 
variance is the sum of the two variances stated in (a), determine, with the help of 
the Einstein-Fowler formula, the general expression for ( E). 

7.3 (a) Determine the mean and the variance of the probability distribution 


pM) 


p" 

(i + py ,+1 


where p is a positive constant and the random variable n takes on all non-negative 
integer values. 

(b) The probability distribution for the number of photons of energy hv in a mode of 
blackbody radiation at temperature T may be shown to be given by the expression 

p 2 ( n ) = (1 - e“ /,y/W 'B r) ) e -nhv/(k B T)^ 

where k H is the Boltzmann constant. Show that p 2 (n) may be expressed in the form 
P\(n) with an appropriate choice of the parameter p. 

Determine also the variance of p 2 (n) in the two limiting cases when 


hv/(k B T) <$ 1 and hv/(k B T) > 1. 


7.4 The time-dependent intensity of a single-mode amplitude-stabilized laser beam, inci¬ 
dent normally on a photodetector, has the form 

/(f) = i/ 0 [l + cos(« 0 f + 6>)], 

where 1 0 and w 0 are constants and 9 is a random variable which is uniformly distrib¬ 
uted on the interval (0, n). Determine the mean and the variance of the number of 
photoelectrons emitted by the photodetector in a time interval of duration T. 

7.5 A laser beam of constant average intensity is incident normally on the surface of a pho¬ 
todetector. What is the probability p(n ) that n photoelectrons will be emitted from the 
detector in the time interval (t, t + T)? Comment on the physical significance of the 
result. 
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7.6 A stochastic electromagnetic beam is incident normally on a photoelectric detector. It 
is found that the probability of n photoelectrons being emitted in a time interval (/, t + T) 
is the Bose-Einstein distribution for a cell in phase-space, viz.. 


Pin) = 


n” 

(n + l) n+1 


Determine the probability p(W) of the integrated intensity 

nt+T 

w = J Ht'W 


of the incident light. 

What is the corresponding result when Pin) is the Poisson distribution 





8 


Elementaty theory of polarization of 
stochastic electromagnetic beams 


Up to now we have simplified our discussion of optical fields by treating them as scalar 
fields, i.e. we have ignored their polarization properties which arise from the vector nature 
of the electromagnetic field. In this and the next chapter we will discuss the extension of 
the theory by taking the vector nature into account for a broad and a useful class of vectors 
fields, namely stochastic electromagnetic beams. 


8.1 The 2x2 equal-time correlation matrix of a quasi-monochromatic 

electromagnetic beam 

Let us consider an electromagnetic beam, i.e. an electromagnetic field which propagates 
close to a particular direction, which we will take to be the z direction (see Fig. 8.1). We 
will assume that the fluctuations of the electric and the magnetic field vectors are charac¬ 
terized by ensembles which are statistically stationary, at least in the wide sense. Let EJt) 
and E y (t) be the components, represented by complex analytic signals (see Section 2.3), of 
the electric field at some point P in the beam, in two mutually orthogonal directions, per¬ 
pendicular to the z direction. 


E x (t) 
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Fig. 8.1 Notation relating to propagation of an electromagnetic beam close to the 
Z direction. 
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Assuming that the beam is quasi-monochromatic and of mean frequency to. we may express 
E x and E y in the form 

E t (t) = fl,(t)e i[ ^^“ 1 , E y (t) = (1) 

As we learned in Section 2.3 in connection with the envelope representation of quasi- 
monochromatic signals, the amplitudes n,(r) and a 2 (t) and the phase functions <p\(t) and 
<p 2 (t) change slowly with time over intervals which are short relative to the coherence time 
t c ~ 2tt/Aw, Aw being the effective bandw idth of the light. 

The second-order correlation properties of the electric field at a point O may be charac¬ 
terized by a 2 X 2 correlation matrix 

J - - , ( 2 ) 

[(El(t)E x (t)) (E* y W f (t))\ 

the asterisk denoting the complex conjugate. The matrix J is called the polarization matrix. 
(A less appropriate name, “coherency matrix.” is used in older literature.) This is an equal- 
time correlation matrix. Its diagonal elements are averages of the intensities associated with 
each of the two components and its off-diagonal elements are analogous to the mutual 
intensity J(rp r 2 ) of a scalar field, which we encountered in Section 3.1. However, whilst 
the mutual intensity of the scalar field V(r, t) is a measure of correlations, at the same 
instant of time, at two points r, and r 2 , the off-diagonal elements of this matrix are mea¬ 
sures of the "equal-time” correlations between the mutually orthogonal components E x and 
E y of the electric field at a particular point. 

We saw in Section 3.1 that the mutual intensity, and hence the diagonal elements of the 
matrix, may be determined by means of Young’s interference experiment. We w ill now 
show that also the off-diagonal elements of the polarization matrix J may be determined by 
relatively simple interference experiments. 

Suppose that the beam is passed through a compensator and then through a polarizer 
(Fig. 8.2). 

Let ej and e 2 be the phase changes introduced in the components E x (t) and E y (t) by the 
compensator as the beam propagates from a plane z = c 0 to a plane c = - : and let 9 denote 



Fig. 8.2 Transmission of a quasi-monochromatic beam through a system consisting of 
a compensator and a polarizer. 
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Polarization of stochastic electromagnetic beams 


the angle which the directions of vibrations of the electric vector emerging from the polar¬ 
izer makes with the x-axis. Apart from an unessential constant phase factor and ignoring 
reflection losses at the surfaces of the compensator and the polarizer, the linearly polarized 
electric field vector which emerges from the system is given by the expression 

= [£' v (t)e 1 h cos# + E y (t)e ie - sin#]i fl , (3) 

where ig is the unit vector in the direction of the linearly polarized electric vector, i.e. the 
two-dimensional unit vector with components (cos #, sin#). 

The averaged intensity, or, more precisely, the expectation value of the electric energy 
density of the light transmitted by the system, is given, in suitable units, by the formula 

/(£[, £ 2 , #) = (E* (/, erj, e 2 , 0) ■ E(f, Ei, e 2 , 0)j, (4) 

apart from a proportionality constant which depends on the choice of units. On substituting 
from Eq. (3) into Eq. (4) we find that 

I(e i, e 2 , 0) = 1(6, 9) 

= J xx cos 2 # + J yy sin 2 # + / vv e' 4 sin #cos 9 + J yK c ’*cos 9 sin #, (5) 

where 

6 = s 2 - (6) 

and 

J xx = (E* x (t)E x (t)), J x> = {E* x (t)E y (t% 

J yx = KmM j yy = (E;w y ( t )}, ( 7 ) 


are the elements of the polarization matrix J. We note that 


J 


V.V 



( 8 ) 


i.e. the matrix is Hermitian. Moreover, it is also non-negative definite , i.e., with any (real or 
complex) numbers Cj and c 2 , 

C l C l^xt + C 2 C 2^yy ^ C \ C 2^xy C l C 2^yx ~ (9) 


This result follows at once from the obvious fact that |c,E v + c 2 E y | 2 ) > 0. From this inequal¬ 
ity or by applying the Schwarz inequality to the term /„ = 1 E* x (t)E y (t)) and making use of the 
Hermiticity condition (8) one readily finds that 


i-g 


(10) 
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We may characterize the correlation between the x and y components of the electric field 
by the correlation coefficient 


J 



J 


(ID 


The inequality (10) then implies that 


0 — Un>l — 1- 


( 12 ) 


The extreme value \j„\ = 1 represents complete correlation; the other extreme value, 
\j xy \ = 0. represents complete absence of correlation between the fluctuating x and y com¬ 
ponents of the electric field vector. 

If one measures the average intensity with various phase delays 6 and angles 9 of orienta¬ 
tion by the use of phase plates and polarizers one obtains, using Eq. (5), a set of linear equa¬ 
tions from which the four elements of the 2 X 2 polarization matrix J can be determined. 1 

Instead of characterizing the properties of the fluctuating electric field by the 2 X 2 
polarization matrix J. one frequently uses an older representation, in terms of Stokes 
parameters They may be defined in terms of averages involving the amplitudes and the 
phases of the x and y components of the complex electric field by the formulas 


s 0 = (ci~U)) + (m;(0), 

■*i = («i 2 ( r )) - (ajU)), 

s 2 = 2{s 1 Cf)fl 2 (r)c:os[(£ 1 (/) - (Mt)]), 

% = 2(8,00 a 2 (f)sin[0,(/) - 4> 2 {t)]). 


(13) 


It follows from Eqs. (13), (7) and (1) that the Stokes parameters and the elements of the 
polarization matrix J are related by the formulas 



(14a) 


and 



(14b) 


Expressions for the four elements in terms of a particularly convenient choice of the parameters 6 and 6 are 
given in B&W', p. 621. 
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One can readily derive formulas that describe the changes which the polarization matrix 
J or the so-called Stokes vector s (% s b s 2 , $ 3 ) undergoes as the beam passes through 
various linear non-imaging devices such as polarizers, compensators and rotators. In the 
correlation-matrix formalism they are represented by 2 X 2 “transmission” matrices. In 
the formalism which uses the Stokes parameters they are represented by 4 X 4 matrices 
known as Mueller matrices. Since this topic is treated in many publications, 1 we will not 
discuss it here. 

When the electromagnetic beam is monochromatic, the Stokes parameters are particu¬ 
larly useful for representing certain geometrical properties of the oscillating field vector, as 
we will see in Section 8.2.4. 


8.2 Polarized, unpolarized and partially polarized light. 

The degree of polarization 

Of special interest are fields for which the modulus of the degree of correlation y' defined 
by Eq. (11) of Section 8.1, takes on one of the extreme values, either \j xy \ = 1 or = 0. 
We will now consider these two cases. 


Suppose that 


8.2.1 Completely polarized light 


iA-yl = 1 - ( 1 ) 

In this case the x and the y components of the electric field are completely correlated. It fol¬ 
lows at once from Eqs. (11) and (8) of Section 8.1 that then 

= 0 . ( 2 ) 

Conversely, the vanishing of the determinant of the polarization matrix may readily be seen 
to imply Eq. (1), i.e. it implies complete correlation between the x and the y components of 
the electric vector. 

It is known from elementary theory of matrices that the determinant of the polarization 
matrix J is invariant with respect to rotation of the jr- and the v-axes about the direction of 
propagation of the beam. Hence if the x and the y components of the electric field are com¬ 
pletely correlated for a particular set of coordinate axes they will also be completely corre¬ 
lated for any such pair of axes. 


See, for example. G. E. Parrent and P. Roman. Nuovo Cimento 15 (1960), 370-388; E. L. O'Neill, Introduction 
to Statistical Optics (Addison-Wesley, Reading, MA (1963); reprinted by Dover, New York. 2004); and 
E. Collett. Polarized Light (Marcel Dekker, New York, 1993). Chapter 5. 
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From Eq. (1) and from Eq. (11) of Section 8.1. making use of the Hertniticity of the polar¬ 
ization matrix expressed by Eq. (8) of Section 8.1, it follows that 


J = 




(3) 


where o is real. 

Equation (1), or. equivalently, the statement that E x and E y are completely correlated, 
characterizes light which is said to be completely polarized at the point P(r). This terminol¬ 
ogy is used because of a formal analogy between this situation and the behavior of a (nec¬ 
essarily deterministic) monochromatic field. Such fields will be discussed in Section 8.2.4. 
Here we only explain the reason for the analogy. 

A monochromatic electromagnetic beam is, at every point, completely polarized in the 
sense that with increasing time the end point of its electric field vector moves on an ellipse 
(see B&W, Section 1.4.3). One says that the field is ellipticallypolarized. In some cases the 
ellipse may, of course, reduce to a circle or a straight line, in which cases we speak of cir¬ 
cular and linear polarization, respectively. 

Consider a monochromatic beam propagating close to the positive z direction Let 

E x (z, t) = ce^"'*'". EJz, t) = e 2 e i(fc - wr) (4) 


(k - w/c, c being the speed of light in vacuum) be the components of the (complex) elec¬ 
tric vector at an arbitrary point, along two mutually orthogonal directions, specified by unit 
vectors ei and e 2 , perpendicular to the direction of propagation. One can associate with this 
monochromatic wave a 2 X 2 matrix which is somewhat analogous to the polarization 
matrix [Eq. (2) of Section 8.1] of a random and, therefore, necessarily not monochromatic 
field, namely the matrix 


E*E EE 

X X X V 

/ . / •: e: / 

V X V V 


(5a) 


e Z e i e 2 e 2 


(5b) 


It is to be noted that the elements of this matrix do not involve any averaging. 

It is evident that the determinant of this matrix has the value zero, just as is the case when 
the polarization matrix represents a completely polarized beam [see Eq. (2)]. This result 
implies a certain equivalence theorem, namely that the “canonical” expenment which employs 
only a compensator and a polarizer for determining the elements of the polarization matrix 
J cannot distinguish between a random quasi-monochromatic beam whose polarization 
matrix has the form given by Eq. (3) and a strictly monochromatic beam whose polariza¬ 
tion matrix has the form (5b), with 


e 


i 



(6) 


where a ] and o 2 are real arbitrary constants. 
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8.2.2 Natural (unpolarized) light 


We now consider the other extreme case, namely when the degree of correlation takes on 
the value zero, i.e. when 


(7) 


irrespective of the particular choice of the x- and y-axes. Light with these properties is said 
to be natural light, because it very frequently occurs in nature, for example light reaching 
the Earth from most astronomical sources. For reasons that will be explained shortly, such 
light may also be said to be unpolarized. 

Let us consider some implications of Eq. (7). For this purpose let us first examine how 
the matrix J changes when the axes are rotated by an angle, say ©, in the anticlockwise 
sense around the direction of propagation of the beam. If E x -, Ey are the components of E 
referred to the rotated coordinate system Oy, Oy (see Fig. 8.3) one has 


E x , = E x cos 0 + E v sin 0, 

E • = — E sin 0 + £ cos 0. 
y x y 


( 8 ) 


The elements of the polarization matrix, referred to the rotated coordinate system, are 
•4ff = (El'Ey) and, using Eq. (8), we see that the polarization matrix in the new coordinate 
system is 


J c 2 + J s 2 + (J + J )cs (J — J )cs + J C 2 — J s 2 

.or yy ' x}' yx ’ ' yy xx / xy yx 

(J — J )cs + J c 2 — J s 2 J s 2 + J c 2 — (J + J )cs 

v yy .XX f yx xy xx yy v xy >x J 


where c = cos 0 and s = sin 9. 

According to the definition of the correlation coefficient [Eq. (11) of Section 8.1] and 
using the Hermiticity condition [Eq. (8) of that section], the requirement (7) implies that 
for natural light 



( 10 ) 



O 


Fig. 8.3 Notation relating to the change in the correlation matrix J when the axes are 
rotated about the direction of propagation of the beam. 
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irrespective of the particular choice of the x'- and v'-axes. In view of the transformation law 
given by Eq. (9), Eq. (10) implies that with any choice of 6 the term (J - ,/ xv )cos # sin 6 = 0, 
i.e. that with any choice of the X- and y-axes 

J yy = J xx . (11) 

Using Eqs. (10) and (11) it follows that the polarization matrix of natural light has the form 


i.e. J is proportional to the unit matrix. The proportionality factor J :x is also independent 
of the choice of the axes, because, w'hen Eqs. (10) and (11) hold, the transformed polariza¬ 
tion matrix is independent of 6. For reasons that will become apparent shortly such a beam 
is also said to be unpolarized. 

Let us consider the transmission of natural light through the system showm in Fig. 8.2, 
consisting of a compensator, which introduces a phase delay 6, and a polarizer, which trans¬ 
mits the component of the electric field vector which makes the angle 0 with the .v-axis. The 
intensity 1(6, 8) = /(e,, e 2j @) of the light emerging from such a system is given by Eq. (5) 
of Section 8.1, viz., 

1(6, 8) = J xx cos 2 6 + J yy sin 2 0 + T^.e^sin 9 cos 6 + J yx z ^ cos Osin 8. (13) 

On substituting from Eqs. (10) and (11) into this formula we obtain for the intensity of the 
transmitted beam the expression 

1(6, 9) = J xx , (14) 

i.e. only half of the intensity of the incident wave is transmitted. The formula shows that, 
w'hen a beam of natural light is transmitted by a compensator and a polarizer, the intensity 
of the light emerging from this system is unaffected both by the retardation introduced by 
the compensator and by the orientation of the polarizer. 


8.2.3 Partially polarized light and the degree of polarization 

We considered tw'o extreme situations: a light beam for w'hich the absolute value of the cor¬ 
relation coefficient j has the extreme value unity and a light beam for which it has the 
other extreme value, zero. We will now show' that the polarization matrix of any light beam 
may be, at each point, expressed uniquely as the sum of these tw'o kinds of beams. 

The ratio of the intensity of the polarized part to the total intensity is called the degree of 
polarization. The nature of the polarized portion (i.e. the length of the principal axes of its 
polarization ellipse and the orientation of the ellipse) together with the degree of polariza¬ 
tion are said to represent the state of polarization of the beam. It generally changes as the 
beam propagates 1 or is scattered, for example, by a solid body or by a system of particles. 

That the degree of polarization may change on propagation, even in free space, appears to have been first 
demonstrated by D. F. V. James, J. Opt. Soc. Amer All (1994), 1641-1643. 
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Such changes are often of considerable practical interest because they provide information 
about the physical system that interacts with the beam. 

Let us then consider the possibility of expressing the polarization matrix of a beam prop¬ 
agating close to the z direction in the form 

J = jW + J‘u) f (!5J 

where the two matrices on the right represent completely polarized light (superscript p) and 
completely unpolarized light (superscript u). According to Eqs. (13) and (12) the matrices 
are of the form 


where 


and 


J»p> 


B D 

D* C 



0 

I 


A > 0, B > 0, C > 0, 


( 16 ) 


(17) 


BC — DD* — 0. (18) 

With J k( , (k = x, y, ( = x, y), representing the elements of the polarization matrix J, Eqs. 
(15) and (16) imply that 


A + B = J , D = J , 

XX .TV ’ 

D = J , A + C =- J . 

y.v' yy 


(19) 


On substituting for B, C and D from Eq. (19) into Eq. (18) we obtain the following equa¬ 
tion for the matrix element A: 

(/„ - A)(7 VV - A) - J x J yx = 0. (20) 

This equation shows that A is an eigenvalue of the polarization matrix J. A simple calcula¬ 
tion shows that the solution to this equation may be expressed in the form 

A = ^jlr J ± [(Tr J) 2 - 4 Det J]i j, (21) 

where Tr denotes the trace and Det the determinant. Since according to Eqs. (8) and (9) of 
Section 8.1 the polarization matrix is Hermitian and non-negative definite, both eigenval¬ 
ues A are necessarily non-negative, as may also be verified by direct calculation. 

Let us consider first the root A given by Eq. (21). with the negative sign in front of the 
square root: 

A = ' l / «, + V - 1 v'A, -4DctJ. 


(22a) 
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On substituting this root into the diagonal terms in Eqs. (19) we obtain the following 
expressions for the matrix elements B and C: 


B = -(/ 

■j v XX 

- V + J ) 2 - 4 Det J, 

(22b) 

C = -(J 

2 »■ 

- J. v ) + y Ur J) 2 - 4 Det J. 

(22c) 


If we make use of the Hermiticity relation J yx = ,/* [Eq. (8) of Section 8.1 ] we find at 
once that 


(dr J> 2 - 4 Det j]* = p xx - JJ 2 + 4jJ/ > - /J. (23) 

Consequently the matrix elements B and C given by Eqs. (22) are necessarily non-negative 
as required by two of the inequalities in (17). On the other hand, the other expression for A 
given by Eq. (21), with the positive sign in front of the square root, may readily be showm to 
yield negative values for B and C and, therefore, does not satisfy the inequalities (17). Thus 
we have shown that there is a unique decomposition of the polarization matrix J in the form 
given by Eq. (15), subject to the constraints given by Eqs. (17) and (18). This result implies 
that any statistically stationary 1 light beam may, at each point, be regarded as being the sum 
of two beams, one of which is completely polarized and the other completely unpolarized. 

It follow's from the expression for the polarization matrix J lp) of the polarized part [the 
first matrix in Eq. (16)] and from Eqs. (22b) and (22c) that its trace (B + C) is given by the 
expression 

Tr J (p) = yj (Tr J) 2 - 4 Det J. (24) 

The trace of the polarization matrix J (u) of the unpolarized part is, according to Eqs. (16) and 
(21), given by the formula 

Tr J' u) = Tr J - ^(Tr J) 2 - 4 Det J, (25) 

where w e used the fact demonstrated earlier that only the expression for A with the nega¬ 
tive sign in front of the square-root sign is admissible. Now, according to the definition of 
the polarization matrix [Eq. (2) of Section 8.1], the trace of that matrix is proportional to 
the average electric energy density and may, therefore, be regarded as a measure of the 
intensity, say /, of the light. Using Eqs. (24) and (25) it follows that the degree of polariza¬ 
tion of light, defined as the ratio of the intensity of the polarized portion / lpl to the total 
intensity I at the point under consideration, is given by the expression 
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Since both the determinant and the trace of the correlation matrix are invariants under the 
rotation of the x- and y-axes about the z direction, the degree of polarization ¥ is independent 
of any particular choice of the x- and y-axes. Also, in view of the inequality (23) one can read¬ 
ily deduce from Eq. (26) that 


Ospsl. 


(27) 


We see at once, on using Eq. (2) in Eq. (26), that, for light that we previously called com¬ 
pletely polarized light, the degree of polarization ¥, given by Eq. (26), does indeed have the 
extreme value ¥ = 1. 

For natural light we have, from Eq. (12), Tr J = 2 J xx and Det J = J 2 XX , so the degree of 
polarization given by Eq. (26) has, in this case, the other extreme value T = 0, i.e. it may 
be said to be unpolarized. 

When the degree of polarization has a value between these tw'o extremes, i.e. w'hen 
0 < ¥ < 1, we say that the light is partially polarized. 

As we saw earlier the polarization matrix is Hermitian [Eq (8) of Section 8.1]. Hence by 
application of a well-known theorem about such matrices 1 it can be diagonalized by a uni¬ 
tary transformation (which, however, need not be a rotation). It will then have the form 


J' = 


A, 

0 


0 

A, 


(28) 


A[ and A 2 being the eigenvalues of the polarization matrix. Moreover, because the polariza¬ 
tion matrix is also non-negative definite [Eq. (9) of Section 8.1] the eigenvalues are necessar¬ 
ily non-negative. Since the trace and the determinant of a matrix are invariant under rotation 
of the r- and y-axes about the z direction," the determinant of the original polarization matrix 
J and the determinant of its diagonalized matrix [Eq. ( 28)] must be equal to each other and 
the same is true of the traces. Hence 

Det J = AjAo, (29a) 

Tr J = A, + A 2 , (29b) 

and the degree of polarization, given by Eq. (26), may, therefore, be expressed in the sim¬ 
ple form 


<p = 


4A,A, 

(A, t- A,) 2 


(30a) 


i.e. 


T = 


\ ^21 

At + Aj 


(30b) 


1 F. W. Byron and R. W. Fuller, Mathematics of Classical and Quantum Physics (Addison-Wesley, Reading, MA. 
1969); reprinted by Dover. New York. 1992, Vol. ], p. 165. Theorem 4.20. 

2 See, for example, F. W. Byron and R. W. Fuller, loc. cit.. p. 119, Theorem .413. 
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Since the eigenvalues are independent of the choice of the x- and y-axes it is clear that the 
degree of polarization T is also independent of that choice, as, indeed, must be the case if 
T is to have an unambiguous physical meaning. 


8.2.4 The geometrical significance of complete polarization. The Stokes 
parameters of completely polarized light. The Poincare sphere 


We saw in Section 8.2.1 that completely polarized light is characterized by the property 
that the x and y components of the electric field vector are completely correlated and that, 
consequently, the polarization matrix is expressible in the form [Eq. (5b) of Section 8.2.1] 


(31) 


where e t and e 2 are independent of time. We have also seen that such a matrix is indistinguish¬ 
able from the polarization matrix of a monochromatic plane wave with w = a> and k — k, 

Efz, t) = e ] e i ' kz ~ u,, \ Efz , t) = e 2 e i( ' fe '“" ) - (32) 

We will now consider some geometrical implications of Eqs. (32). 

Equations (32) represent the Cartesian components of the electric field vector in com¬ 
plex form. The physically meaningful quantities are their real parts, i.e. 

= |e,|cos(a, + kz ~ cot), (33a) 

Eyfz, t) = |e 2 |cos(o- 2 + kz ~ wt), (33b) 


where a, and a 2 are the phases of e\ and e 2 , respectively. 
It will be convenient to set 


= lejcosta, + kz). 

q x = hlsinla, + kz). 

(34a) 

= |e ; |cos(a 2 + kz), 

q , = k 2 | sin(o 2 + kz). 

(34b) 


By expanding the cosine terms on the right-hand sides of Eqs. (33) and (34) by the use of 
elementary trigonometric identities, the components of the electric field may be expressed 
in the form 

E { fHt) = p x cos(iot) + q x sin(w?), (35a) 

E [ P(t) = p y cos(<at) + q y sm{wt), (35b) 

where we suppressed the explicit dependences of the various quantities on z. On combin¬ 
ing the tw'o scalar formulas (35) into a simple vector formula we have 

E (r) (f) = pcos(wt) + qsin(wf), (36) 


where p = ( p x , pfi and q = Up. qf. 
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Equation (36) shows that, with increasing time, the end point of the electric field vector 
E (r) (r) at a fixed point in space describes a curve in the plane containing the two real vec¬ 
tors p and q and passes through their end points. Because cos(wr) and sin(wt) are periodic 
functions of time t. the curve is evidently closed. We will now show that, in general, the 
curve is an ellipse. To see this let us rewrite Eq. (36) as 

E«(r) = He{( p + iqle - '"'}. (37) 

where He again denotes the real part. Let us set 

(p + iq) = (a + ibje'C (38) 

where e is a scalar parameter. In terms of p, q and £ one evidently has 

a = p cos e + q sin e, (39a) 

b = — p sin e + q cos e. (39b) 

Let us now choose the parameter e so that the vectors a and b are mutually orthogonal with 
|aj ^ |b|. In that case £ must evidently satisfy the equation 

(p cos e 4 q sin e) ■ (— p sin £ + q cos e) = 0. (40) 

From this equation it readily follows that 

tan<2£) = ‘ q , . (41) 

p- - q- 

We may take as the parameters which specify the behavior of the electric field at any par¬ 
ticular point the time-independent components of the mutually orthogonal vectors a and b 
and the associated parameter e, rather than the six Cartesian components of the vectors 
p and q. From Eqs. (37) and (38) it then follows that 

E (r, (f) = a cos (lot — £) + b sin(w? — £). (42) 

If we take Cartesian axes with the origin at the point where the electric field is being 
considered and with the A" and y directions along the vectors a and b. respectively, we evi¬ 
dently have 


£h) = a C0S ( W f — £(r) = £> s in( w r — e), £ = o (43) 

(a = lal, b = lb!). These equations represent an ellipse in the x, y-plane. known as the polariza¬ 
tion ellipse , 



a 


2 


+ 



b 2 


= 1 , 


(44) 
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with the semi-axes a and b along the x and v coordinate axes. By elementary geometry it 
may be shown that p and q are a pair of conjugate semi-diameters of the ellipse. 1 

The ellipse may be traversed in one of two possible senses. In conformity with frequently 
used terminology we say that the polarization is right-handed when, to an observer look¬ 
ing in the direction from which the light is arriving, the end point of the electric field vec¬ 
tor would appear to traverse the ellipse in the clockwise sense. In the opposite case we say 
that the polarization is left-handed. These two situations are distinguished by the sign of the 
scalar triple product [a, b, Ve] = [p, q. Vet]. 

We may readily determine the setni-axes a and b of the polarization ellipse in terms of 
the original quantities p and q and the parameter e, given by Eq. (41). We have from (39a) 

a 2 = p 2 cos 2 j + q sin 2 ^ + 2p • q cos e sin <•: 

= + T) + ~f (P 2 ~ <? 2 )cos(2e) + p • qsin(2t). (45) 

From Eq. (41) it follows that 

sin(2e) = -? = q =. cos(2e) = , P q - =. (46) 

yj(P 2 “ q 2 ) 2 + 4(p • q) 2 V( P 2 ~ <? 2 ) 2 + 4(p • q) 2 


Hence 


P 2 + q 2 + \I(P 2 - q 2 ) 2 + 4(p • q) 2 


(47a) 


Similarly, one finds that 


b 2 


p 2 + q 1 - ^j(p 2 ~ q 2 ) 2 + 4(p • q) 2 


(47b) 


To determine the angle between a and p we express the equation of the polarization 
ellipse in a parametric form, 

= acostfr, = bamcp, (48) 


where <p is the so-called eccentricity angle (see Fig. 8.4). According to elementary geome¬ 
try the angle <f> is related to the polar angle 0 of the point (Ef \ E'f ) by the formula 

n & 

tan 0 = — tan <p. (49) 

a 


1 For discussion of conjugate diameters see, for example. A. Robinson, An Introduction to Analytical Geometry 
(Cambridge University Press, Cambridge. 1940), Vol. I. Section 14.7. 
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v 



Fig. 8.4 Illustrating the meaning of the eccentricity angle <p of a point (E^\ on the 
polarization ellipse. 


Comparison of Eqs. (43) and (48) shows that in the present case <p = wt — e. Now, accord¬ 
ing to Eq. (36), E {r \t) = p when t = 0, so that the eccentricity angle of p is —e. Equation 
(49) then implies that the angle ip between a and p is given by the formula 

b 

tan 'ip = — tan g« (50) 

a 

Let us introduce an auxiliary angle j3 defined by the equation 

— = tan 0, (51) 

P 

It then follows from the formula (41) and from elementary trigonometric identities that 

tan(2c) = tan(2i9)cos 7 , (52) 

where 7 is the angle between the vectors p and q. 

Let us summarize the results which we have just derived. If the real vectors p and q are 
given [see Eq. (36)], and if 7 denotes the angle between them and /3 denotes the auxiliary 
angle defined by Eq. (51), then the semi-axes of the ellipse and the angle tp which its major 
axis makes with the vector p are given by the formulas (47) and (50) (see also Fig. 8.5), 
where S' is given by Eqs. (52) and (41). 

Two special cases are of interest, namely when the ellipse degenerates into a circle and 
when it degenerates into a straight line. In the first case one says that the electric field at the 
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v 



Fig. 8.5 Illustrating the significance of various parameters used for specifying the polar¬ 
ization ellipse and some useful relations between them: 

E' r, (t) = pcos(o)t) + qsin(wt) 

= acos(oil — e) + bsin(aif - e), 
b 

tan i. > = — tan 5, 
a 

tan P = —, 

P 

tan(2e) = ———— = tan(2fl)cos j. 
p 2 - q- 


point under consideration is circularly polarized. Then a and b and, consequently, also £ 
are undetermined. According to Eq. (41), one then has 

p • q = p 2 - q 2 = o. (53) 

When the ellipse degenerates into a straight line, i.e. when the wave is linearly polarized , 
there is no minor axis ( lr — 0) and the formula (47b) then gives 

<p • q ) 2 = p 2 q 2 . (54) 

We briefly mentioned towards the end of Section 8.1 an older representation of electro¬ 
magnetic beams in terms of the so-called Stokes parameters [Eqs. (13) of that section]. In 
the special case when the beam is completely polarized the Stokes parameters are given by 
the simple expressions 


S„ = K,l 2 + l e 2l 2 , 
= 1*11 2 - l%I 2 , 
s 2 = 2 l e 'ill t ’ 2 lcos 0 , 
s 3 = 2 ! e ill e 2 lsin 6, 


( 55 ) 
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where 


b = a- 2 - Q| (56) 

is the difference between the phases a \ and a 2 of the (generally complex) quantities e, and 
e 2 . These four Stokes parameters are not independent but are related by the identity 

s a - sf +s- + Sf, (57) 


as follows at once from Eqs. (55), 

The parameter s 0 is proportional to the intensity of the beam. The other three Stokes 
parameters can be shown to be related in a simple way to the angle tf’(0 < it ' n) which 
specifies the orientation of the polarization ellipse and to the angle \( — 7t/2 tt/2) 

which characterizes the ellipticity and the sense in which the ellipse is traversed. The fol¬ 
lowing expressions for the Stokes parameters S|, ,y 2 , s 3 hold: 


5, = so cos(2x)cos(2-0). 

(58a) 

s 2 = s 0 cos(2x)sin(2i/>X 

(58b) 

j 3 = s 0 sin(2\). 

(58c) 


The derivation of these relations is somewhat lengthy. It is given elsewhere (see, for exam¬ 
ple. B&W, Section 1.4.2). 

Equations (58) indicate a simple geometrical representation of all possible states of a 
polarized field. Evidently the three Stokes parameters [Eq. (58)] may be regarded as 
Cartesian coordinates of a point P on a sphere Y of radius j 0 , with 2\ and 2w being the 
spherical angular coordinates of that point (see Fig. 8.6). Hence to every' possible state of a 
fully polarized beam of intensity s 0 at an arbitrary point there corresponds a point P on the 
sphere X] (known as the Poincare sphere) and vice versa. Since \ is positive or negative, 
respectively, according to whether the polarization is right-handed or left-handed, it fol¬ 
lows from Eq. (58c) that a right-handed polarized state is represented by points on Y which 
lie above the equatorial plane (the x, y-plane) and left-handed polarized states are repre¬ 
sented by points which lie below that plane. Further, for linearly polarized light the phase 
difference 6 defined by Eq. (56) is zero or is an integral multiple of n. According to the 
fourth equation of the expressions in Eq. (55), the Stokes parameter s 3 is then zero. Hence, 
linear polarization is represented by points in the equatorial plane. For circular polarization 
|ei| = \e 2 \ and 6 = tt/2 or -tt/2. according to whether the polarization is right or left- 
handed; hence right-handed polarization is represented by the north pole (5, = s 2 = 0, 
s’3 = s Q ) and left-handed circular polarization by the south pole (j 3 = s 2 = 0, s 3 = -s 0 ). 
This geometrical representation of different states of completely polarized light by points 
on a sphere is due to Poincare. It is particularly useful in crystal optics for determining the 
changes in the state of polarization of light which traverses a crystalline medium. 
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Fig. 8.6 The Poincare sphere. Every point P on this sphere, of radius s n , with Cartesian 
coordinates given by the Stokes parameters jj, s 2 , s 2 , represents the state of polarization 
at a point in space. The angles 2\ and 2ip are the polar angles specifying the direction 
of the vector OP [see Eqs. (58)]. Points above the equatorial plane represent right- 
handed polarization; points below it represent left-handed polarization. Linear polariza¬ 
tion is represented by points in the equatorial plane, circular polarizations by the north 
pole and the south pole. 


PROBLEMS 


8.1 (a) Show that the absolute value of the correlation coefficient 


./ 



J 


where J yy and J xy are elements of the polarization matrix, does not exceed the 
value of the degree of polarization. 

(b) Show also that the x- and the y-axes may always be so chosen that \ j xy \ is equal to 
the degree of polarization. 

8.2 Show that the Stokes parameters of a quasi-monochromatic light beam satisfy the 


inequality 



2 


‘3 


0 
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8.3 s 0 , sj, ,v 2 and jr 3 are the Stokes parameters of a monochromatic light beam. Show that 
the expression 


is invariant under rotation of the axes about the direction of propagation of the beam. 


8.4 Derive an expression for the degree of polarization of a quasi-monochromatic light 
beam in terms of the Stokes parameters. 


8.5 In the literature on scattering the following expression for the degree of polarization is 
often used: 

/ v (r) ~ / (r)| 

Pi r) = 

/,(r) + / v (r) 

Here I x and /,. denote the (averaged) intensities of the electric field in two mutually 
orthogonal directions perpendicular to the axis of the beam. 

(a) Show that P is equal to the degree of polarization P, defined rigorously in Section 
8.2.3. if and only if the polarization matrix J is diagonal. 

(b) Determine the conditions under which the polarization matrix of a stochastic elec¬ 
tromagnetic beam may be diagonalized by a rotation of the x- and waxes about the 
direction of propagation of the beam. 

(c) Assuming that the condition for diagonalizing the polarization matrix by rotation 
is satisfied, discuss the meaning of 7 V and I y in the above formula for P. 


8.6 Consider a monochromatic plane wave with Cartesian components E x exp( —iwr), 
E y exp( — \wt) of the complex electric field, in two mutually orthogonal directions per¬ 
pendicular to the direction of propagation. Let 


E = 



E is known as the Jones vector. 

Suppose that the wave passes through a linear non-image-forming device. The Jones 
vector of the emerging wave is then 

E = LE, 

where 


a b 
c d 


is known as the instrument operator (matrix). 
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Determine the instrument operator of the device in each of the following cases. 

(a) A compensator which introduces a phase difference 6 = ip y — tp x between the 
phases and & of the components E x and E y . respectively. 

(b) An absorber which attenuates the x component of the electric field by the multi¬ 
plicative factor e _ct -> and the y component of the electric field by the multiplicative 
factor e~‘V. 

(c) A rotator which produces rotation of the electric field by an angle 9 in the clock¬ 
wise sense around the direction of propagation. 

(d) A polarizer which transmits only the component of the electric field that makes an 
angle 9 with the x direction, measured in the anticlockwise sense around the direc¬ 
tion of propagation. 

8.7 A quasi-monochromatic light beam, characterized by a polarization matrix J passes 

through a linear, non-image forming device. 

(a) Show that the polarization matrix J' of the emergent beam is given by the formula 

J' = l*jl t , 

where L is the instrument matrix (see the previous problem) at the mean wave¬ 
length of the beam, the dagger denotes the Hermitian adjoint and T denotes the 
transpose. 

(b) Show, with the help of the above transformation, that the averaged electric energy 
density of the beam does not change when the beam passes through a compensator 
or a rotator. 

Derive also expressions for the averaged electric energy density of the beam after it has 

passed through (i) an absorber and (ii) a polarizer. 
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Unified theory of polarization 
and coherence 


So far we have treated the subjects of coherence and polarization independently of each other. 
Yet both are manifestations of the same physical phenomenon, namely of correlations 
between fluctuations in light beams. Coherence, we will soon see, arises from correlations 
between fluctuations at two or more points in space. Polarization, on the other hand, is a man¬ 
ifestation of correlation between fluctuating components of the electric field at a single point. 

In recent times, more than 60 years since the publication of Zernike’s basic paper on 
coherence and more than 160 years since Stokes introduced parameters which describe the 
state of polarization of a light beam, a unification of the phenomena of coherence and 
polarization was achieved. 1 In this chapter we will describe this development and we will 
give examples which demonstrate the usefulness of this more comprehensive formulation 
of correlation effects in stochastic electromagnetic beams. 


9,1 The 2x2 cross-spectral density matrix of a 
stochastic electromagnetic beam 

The basic quantity of the unified theory of coherence and polarization of stochastic, statis¬ 
tically stationary, electromagnetic beams is the so-called electric cross-spectral density 
matrix W(r, , r 2 , to), which may be formally introduced as the Fourier transform of the elec¬ 
tric mutual coherence matrix 

(E\(r t)E (r,, t + r)} {E*.(r t)E (tj, t + r)) 

r(r., r,, r) = ' Z * , (1) 

{E' y ( r,, T)E x ( r 2 , t + t)) (E*( r,, t)E y { r 2 , t + t)}| 

where E x and E, are the components of the (complex) electric field vector, represented by 
analytic signals (see Section 2.3) in two mutually orthogonal directions perpendicular to 


1 E. Wolf. Phys. Lett. A 312 (2003). 263-267: Opt . Lett. 28 (2003), 1078-1080; and H. Roychowdhury and 
E. Wolf. Opt. Commun. 226 (2003), 57-60. 
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the axis of the beam (taken to be the g direction). 1 Explicitly one has 


W(r p r 2 , to) = 



( 2 ) 


The matrices T and W are functions of two points, whereas the polarization matrix J of 
Section 8.1 is a function of just one point. This generalization is crucial for elucidating 
many polarization features of a fluctuating electromagnetic beam. In particular, as we will 
soon see, it makes it possible to determine how the degree of polarization may change as 
the beam propagates, v/hether in free space or in a medium. 

We have introduced the cross-spectral density matrix W(r,, r 2 , o>) as the Fourier trans¬ 
form of the mutual coherence matrix r(r b r 2 , r) of the electric field. However, by analogy 
to the important result derived in Section 4.1 for stochastic scalar fields, the cross-spectral 
density matrix may also be expressed as a correlation matrix, i.e. in the form 


W(r,, r 2 , w) = [IF.(r,, r 2 , w)] 


(E*( r, . (o)E x (r 2 , o>)) (£* (ij, <o)E (r 2 , w)) 
{£*(ij, o))E x (r,, &)) (E*{ ij, co)E (r 2 , «)) 


(3) 


O' = x, y; j = x, y). 


where E x ( r, &>) and £ v (r, co) are members of suitably constructed statistical ensembles. 2 

This matrix is particularly useful in formulating the unified theory' of coherence and polar¬ 
ization and for many applications. 

9.2 The spectral interference law, the spectral degree of coherence and the 
spectral degree of polarization of stochastic electromagnetic beams 

By analogy with our earlier discussions of fluctuating scalar wavefields we regard the state 
of coherence of an electromagnetic beam as the ability of the beam to produce fringes of 

1 A related, somewhat less general but nevertheless useful correlation matrix is the so-called beam coherence- 
polarization matrix, which w'as introduced by F. Gori, M. Santersiero, S. Vicalvi, R. Borghi and G. Guattari, 
Pure Appl. Opt. 7 (1988). 941- 951. 

Both matrices are restricted versions of the general 3X3 electric correlation matrix of the electric field (see 
M&W, Section 6.5.1), 

It follows from the envelope representation of quasi-monochromatic signals (Section 2.3) that, 
when |r| «S2-/Ao> (A<u being the effective bandwidth of the light). Hr,. r 2 , r) may be approximated by 
rUh, r 2 . t) *• r(r,. r. 0)exp(— ioir). This approximation corresponds to that given by Eq. (22) of Section 3.1 of 
the scalar theory'. 

: Proof of this result is given in Section 7 of J. Tervo, T. Setala and A. T. Friberg, J. Opt. Soc. Amer. A21 (2005), 
2205-2215. 
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Fig. 9.1 Notation relating to Young's interference experiment with stochastic electro¬ 
magnetic beams. 

appropriate sharpness, in suitable interference experiments. Naturally, we associate a high 
degree of coherence with interference fringes of high visibility and a low degree of coherence 
with fringes of low visibility. The basic experiment which reveals the state of coherence is, 
of course. Young's interference experiment, that we discussed in Sections 3.1 and 4.2 in the 
context of the scalar theory. We will now discuss the experiment by taking into account the 
electromagnetic nature of the beams. Although the analysis will be similar to that which we 
presented for scalar fields, new questions now arise, such as the role which coherence and 
polarization play in propagation of partially coherent electromagnetic beams - a subject 
which is rather subtle and has been clarified only relatively recently. 

Let us again consider a stochastic, statistically stationary, electromagnetic beam which 
propagates close to the z-axis and is incident on an opaque screen A in the plane z = 0 , con¬ 
taining two small openings at points Q\(P\) and (ATpj) (see Fig. 9.1). 

We will determine the distribution of the averaged spectral energy density in a plane % 
placed at some distance beyond the plane jlof the pinholes and parallel to it. 

Let {E(r, «)} represent the statistical ensemble of the electric vector at the point P{ r). A 
typical realization E(r, co) of this ensemble is given, in terms of the realizations E(p|. co) 
and E(p 2 , co) of the electric field vector at the two pinholes, by the formula 

E(r, co) = A’jEfp,, «)e lM i + AT,E(p,, w)e ' kK ~, ( 1 ) 

where R , and R 2 are the distances from the points Q( p,) and Q (p 2 ), respectively, to the point 
P(r), and Aj and K 2 are the same factors as in the scalar case [Eq. (3) of Section 3.1 with A 
replaced by A], again assuming that the angles of incidence and of diffraction at the pin¬ 
holes are sufficiently small. 

Let us now consider the spectral density S( r, w) of the field at the point P( r). We may 
identify the spectral density (the spectrum) with the average electric energy density at that 
point. Apart from an unessential proportionality factor which depends on the choice of 
units, we have 


(2a) 


S(r. co) = (E*(r. co) • E(r. co}) 
= Tr W(r. r. co). 


(2b) 
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where Tr denotes the trace of the cross-spectral density matrix W(r, r, to) which we intro¬ 
duced in the previous section, evaluated for coincident points (rq = r 2 = r), i.e. 

Tr W(r, r, to) = (£*(r, (o)E x (r, to)} + (E*(r, to)E y (r, to)). (3) 


On substituting from Eq. (1) into Eq. (2a) we find that 

S(r, o>) = |^j S(p, co) + \K 2 \ S( p,, co ) 

+ K*K 2 Tr W(p, p,, co)e ik{R 2 - R J 
+ K x K 2 Tr W(p,, p,, a))e~ ik(R 2 - R ^. (4) 

Similarly to the scalar case, this formula may be rewritten in a physically more significant 
form by noting first that, if the pinhole at Q 2 (p 2 ) were closed, then K 2 = 0 and Eq. (4) 
would become S(r, co) = S ll) (r, co), where 

S (1) (r, w) = |K 1 | 2 S(p 1 ,a>). (5) 

S (1) (r, co) evidently represents the spectral density of the field reaching the point P( r) 
through the pinhole Qi(Pi) only. A strictly similar expression is obtained for the spectral 
density, S (2) (r. co), if the light reaches the point P( r) only through the pinhole at Qjifh)- 
Using these expressions and the relation 

TrWip,, p, co) = [TrW(p, p,, w)f, ( 6 ) 


which follows from the definition of the cross-spectral density matrix, we obtain from 
Eq. (4) the following expression for the spectral density at the point P( r): 

S( r, co) = 5 ( 1 ) (i\ w) + r, o>) + 2/s (l) (r. co)^S a) ( r, co) %e[r)( p, p, to)e mR ^ R 0], 

(7) 


where Re denotes the real part and 


t?(Pi, (b. w ) 


_Tr W(p, p,, o>)_ 

^TTW(p|, p, wj^TrWlp,, p,. w) 


Tr W(p, p^, to) 
0))^S( P,, a>) 


If. as is usually the case, S (2) (r, co) **• S (1) (r, co), Eq. (7) reduces to 

S(r, to) = 2S n) (r, «){1 + *Re[i)(s\, p,, «)e ilS ]} 

= 2S (1J (r. a>)|l + |??(p, p 2 , a>)|cos[a(p, p,, to) + <$]}, 


( 8 a) 

( 8 b) 


( 9 ) 
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where a(p|, p 2 . o>) is the argument (phase) of ;/ and 


6 = k(R 2 - R f ) 


( 10 ) 


is the phase difference associated with propagation from ( 2 t(Pi) to /’(r) and from Qiipf ) t0 
P( r), respectively. 

Equation (7) shows that the spectrum at a point P( r) in the observation plane R is not just 
the sum of the spectra of the two beams reaching that point from the two pinholes but 
differs from it by the presence of the third term, which evidently represents the effect of 
interference. We may, therefore, refer to Eq. (7) as the spectral interference law for the 
superposition of stochastic electromagnetic beams. It is of the same form as the spectral 
interference law for scalar wavefields [Eq. (5) of Section 4.2], the only difference being that 
the factor //(p,, p 2 . w) defined by Eq. ( 8 a) now appears in place of the factor p,(p |; p-,. u>) 
defined by Eq. ( 6 ) of Section 4.2. 

It is seen at once from the spectral interference law [Eq. (9)] that, as the path difference 
Ri ~ R i and, consequently, the phase difference 6 changes and the spectral density S(r, w) 
varies sinusoidally between the values 

S max (r. w) = 2 S ( "(r, “){ 1 + |?7(Pr IV w)|} (11a) 

and 

5 min (r - w) = 2 S (ll (r, w ){ 1 - |/Kp,, p,. w)j}. (lib) 


Hence the spectral visibility V(v. at) of the fringes is given by the expression 


‘Ifr at) = ~ <o, 

5 max< r - "1 + 

= Pi^ w )|- 


( 12 ) 


Evidently 

0 < |?;(P|, p,. w)\ < 1, (13) 

and the fringes are sharpest (visibility 1 ) when \rj\ = 1 and there are no fringes at all 
('E= 0) when ij = 0. We may, therefore, identify //(p,, p 2 . w), defined by Eq. ( 8 a), with the 
(generally complex) spectral degree of coherence of the fluctuating electric field at the 
points QfPi) and Q 2 { p 2 ). It can be determined experimentally from visibility measure¬ 
ments. The phase of p can also be determined from measurements, namely by determining 
the location of maxima in the interference pattern, in a manner similar to that in the scalar 
case [see the discussion which follows Eq. (13) of Section 4.2]. It is of interest to note that 
Eq. ( 8 a), which defines the degree of coherence r/(p,. p 2 , w), depends only on the diagonal 
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elements p t , p 2 , to) and W ?< (pp p 2 , co) of the correlation matrix W. The physical reason 
for this fact can be understood by noting that two mutually orthogonal vector components 
t’.x and E v y of the electric vector, with x and y denoting unit vectors along the x and y direc¬ 
tions, do not interfere, because x • y = 0. This is the essence of some of the classic 
Fresnel-Arago interference laws, 1 which were formulated before the electromagnetic the¬ 
ory of light was established. The fact that two mutually orthogonal components of the elec¬ 
tric field do not interfere does not, of course, imply that they are necessarily uncorrelated. 

Although the off-diagonal elements W vv and IT VI do not contribute to the coherence 
properties of a beam, they play a role in specifying its polarization properties; in particular 
their spectral degree of polarization lP(r, o>). This important quantity is defined by a for¬ 
mula strictly analogous to Eq. (26) of Section 8.2.3 with the polarization matrix J(r) 
replaced by the cross-spectral density matrix W(r,, r 2 , w), restricted to the situation where 
r, = r 2 = r, i.e. 



(14) 


Det again denoting the determinant and Tr the trace, i.e. 



Tr W(r, r. w) = W tv (r, r, at) + lV vv (r, r, at). 


(15b) 


Since the degree of polarization is expressed in terms of both the trace and the determinant 
of the correlation matrix W it does indeed depend, in general, not only on the diagonal 
but also on the off-diagonal elements of the matrix. Finally we stress that, whilst the spec¬ 
tral degree of coherence depends on the behavior of the electric field at two points, 
the spectral degree of polarization depends on the behavior of the electric field at a single 
point only. 

9.3 Determination of the cross-spectral density matrix from 

experiments 

We will now show how the elements of the cross-spectral density matrix of the electric field 
of a stochastic beam may be determined experimentally. 

Let us suppose that we again perform Young's interference experiment, with incident 
light that is filtered so that it becomes effectively quasi-monochromatic around the 

1 For accounts of the Fresnel-Arugo interference laws see, for example, E. Collett. Am. J. Phys. 39 (1971), 
1483-1495; E. Collett, Polarized Light Fundamentals and Applications (Marcel Dekker, New York. 1993), 
Chapter 12; or C Brosseau, Fundamentals of Polarized Light (Wiley, New York, 1998). p. 6. 

A derivation of the Fresnel-Arago interference laws which incorporates effects of coherence is given in 
M. Mujat, A. Dogariu and E. Wolf, J. Opt. Soc. Anter. 21 (2004). 2414-2417. 
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frequency to. According to the spectral interference law given by Eq. (7) of Section 9.2, the 
spectral density at the point P( r) may be expressed in the form 


S(r, w) = S'"(r. to) + S (2l (i\ to) 

+ 2^ ( "(r,"^/s ,2) (r, uj | /?(P| , p,. W )| cos[a(p,. p,. at) + dj, (l) 


where S is the phase difference k(R 2 ~ Ri )• In this formula S ' 11 (r) is the spectral density of 
the field which would be observed at P(r) with only the pinhole at Q dp,) opened. S ( 2 ) (r) 
having a similar meaning. Further, 


V( P r Pr «) <m |//(p,, p,, ft))|e ,,1,p i-P)-“ ,) 
Tr W(p r ps, to) 
yjS( p,, to) 


( 2 ) 


is the spectral degree of coherence of the electric field at the pinholes [Eq. (8b) of 
Section 9.2]. 

Suppose that polarizers II, and n 2 , which transmit only the x components of the incident 
electric field, are placed in front of the pinholes. The cross-spectral density matrix, 
[Wp!, p 2 , ft>)] + say, of the light emerging from the pinholes is 


[W(p r p 2 . w)] + 


<£*(p,. w)£ v (ps, &>)) 0 

0 ~ 0 


(3) 


Clearly 

[Tr W(p|. p 2 , w)] + = lT u .(pi, p 2 . a), (4) 

i.e. the trace of the cross-spectral density matrix of the transmitted light is the leading- 
diagonal element of the cross-spectral density matrix W of the light incident on the 
pinholes. Using Eq. (4), Eq. (2) gives 

W *A ( Pr P2’ w ) = V S v ( Pi' sl S JP2’ w ) ? /v.v ( Pi‘ Pi' 

where the subscripts on the quantities on the right indicate that the values of the spectra and 
of the spectral degree of coherence pertain to this experimental set-up. i.e. when only the x 
component is transmitted. 

In a similar way, if polarizers which transmit only the _v component of the incident beam 
are placed in front of the pinholes, one obtains in place of Eq. (5) the analogous formula 

W yCpi, p,. ft)) = ^S,,(Pj, to) ^S v (p„ ft)) //,, (p,. p 2 , to). (6) 

An expression for the off-diagonal elements of the matrix W(p b p 2 , w) may be obtained 
as follows: linear polarizers are again placed in front of the pinholes. The polarizer at 
GiCpj) transmits the x component and the polarizer at Q 2 ( p 2 ) transmits the v component of 
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the incident light. In addition a rotator is placed behind the polarizer at the pinhole Q 2 (p 2 ). 
which rotates the transmitted field component through 90° in the clockwise sense about 
the beam axis [the (positive) z direction]. The cross-spectral density matrix of the light 
emerging from the pinholes now is 

[Wipj, p 2 , w)r = 

Hence, with this arrangement, 

[Tr W(p|. p 2 , 6))]~ = W xy ( P] , p : , to). (8) 

Equation (2) now gives, in obvious notation, 

W xy (p r P 2 , oi) = yjs^X p,, to) w)^,(p,, P ; . cu). (9) 

With a strictly analogous arrangement one can obtain the other off-diagonal element of 
the W matrix in the form 

(P r Pi’ 0J > = ^5 ? (p r <u) <JS X ( p 2 , to) // vv (p,, p 2 , w). (10) 

The spectra ■S\(pj, to) and .S\(p 2 . to) may be measured with the help of the usual spectro¬ 
scopic devices and, as we noted earlier (see the discussion following Eq. (13) in Section 
9.2), the spectral degree of coherence (p,, p 2 . w). Q,j = x. y), may be determined from 
interference experiments. Thus, we have shown how all four elements of the 2 X 2 cross- 
spectral density matrix W(p,, p 2 . w) may be measured. 

9.4 Changes in random electromagnetic beams on propagation 

W'e showed in Section 4.2 that the spectrum of light may change on propagation. Such 
changes may be said to be induced by the coherence properties of the source. The analysis 
in Section 4.2 was based on scalar theory. One might expect that spectral changes will also 
be generated when the electromagnetic nature of the light is taken into account. One might also 
expect that there will be other changes on propagation, for example, in the degree of polar¬ 
ization of the beam and in its state of polarization, i.e. changes in the size, the shape and the 
orientation of the polarization ellipse of the polarized portion of the beam. In this section 
we will consider such changes which may also be said to be correlation-induced. A treat¬ 
ment using the cross-spectral density matrix , which we will sometimes just call the corre¬ 
lation matrix for short, provides a unified approach to determining changes of this kind. To 
elucidate them we will first determine how the matrix changes as the beam propagates. 

9.4.1 Propagation of the cross-spectral density matrix of a stochastic 
electromagnetic beam - general formulas 

Let {E' 0) (p', tu)} and {E(r, w)} be the statistical ensembles, introduced in Section 4.1, of the 
spectral components at frequency w of the fluctuating electric field vectors at a point Q( p' ) 
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in the source plane z = 0 and at the point P{ r) = P( p, z.) in the half-space z > 0, respec¬ 
tively (see Fig. 9.2). Then [see M&W. Eqs. (5.6-14) and (5.6-17)] 

EXr, at) = e fc f EfH p', w)G(p - p'. « w)d 2 p', (1) 

J J (z— 0 ) J 


where j = .v ary represents the components of the electric field vector in two mutually orthog¬ 
onal directions, perpendicular to the axis of the beam (the z direction). Further, 


G(p - p', z; no) = - exp 
27rz 


U(p - p ) 


>\2 


2 z 


( 2 ) 


(k = m/c. c being the speed of light in vacuum) is Green's function for paraxial propagation 
from the source point Q(p') to the field point P {r = p, z). 

On substituting for Ej from Eq. (1) into Eq (3) of Section 9.1 one obtains the following 
expression for the spectral correlation matrix W at a pair of points located in any transverse 
plane z = constant > 0 in terms of the 2 X 2 correlation matrix W (0) of the electric field 
vector at pairs of points in the source plane z = 0: 

W(pp ptj.z;a>) = JJ =q W ( °)(p,.p,',w')/f(p l - p;,p, - p^.z; w)d 2 p | , d 2 p', (3) 

where 

K(p\ ~ Pp PS ~ P r z; co) = G (p, - pj, z; w)G(p, - p^, z; at). (4) 

Equation (3) applies to propagation in free space. It is not difficult to generalize it to prop¬ 
agation in any linear medium whether deterministic or random. 1 




Fig. 9.2 Notation relating to the propagation of an electromagnetic beam. 


1 Some applications of the generalized formulas have been used in studies of propagation of random electromag¬ 
netic beams in the turbulent atmosphere. See, for example, H. Roychowdhury, S. A. Ponomarenko and E. Wolf, 
J. Mod. Opt. 52 (2005), 1611—1618; M. Salem, O. Korotkova, A. Dogariu and E. Wolf, Waves in Random Media 
14 (2004), 513-523; and O. Korotkova, M. Salem, A. Dogariu and E. Wolf, Waves in Random and Complex 
Media 15 (2005), 353-364. 
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9.4.2 Propagation of the cross-spectral density matrix of an 
electromagnetic Gaussian Schell-model beam 

We will illustrate the usefulness of the preceding analysis by a number of examples. We will 
restrict ourselves to correlation-induced changes in so-called electromagnetic Gaussian 
Schell-model beams. These are model beams which are generalizations of the scalar 
Gaussian Schell-model beams that we encountered in Section 5.3. The elements of the cor¬ 
relation matrix of planar sources which generate such beams are 


Hfv;, p(, co) = y[sfH p[. m)pf( pi - P ;. co), 

U = -v. y; j = X, V). 


(5) 


Here pj and pi are two-dimensional position vectors of points in the source plane z = 0, 
Sj ()) (pj, to) and Sj 0) (pi, to) denote the spectral densities of the i and the j components, 
respectively, of the electric field vector in the source plane 4 = 0, and p'" (pi - pj. co) 
denotes the degree of correlation between the components E, at pj and E. at pi. Both these 
quantities are given by Gaussian functions, viz., 

5 (l), (p . 10 ) = At exp[—p' a /(2a ; 2 )], (6a) 

4 ° ,( -P2 ~ Pp Lo) = B ij ex PKf| “ Pj [fK2Spl (i = x. y: j = x.y). (6b) 


The parameters A ; , By, cr, and 6y are independent of position but may depend on the fre¬ 
quency to. However, they cannot be chosen quite arbitrarily. In particular 

By = 1 when i = j, (7a) 

Ifi.J £ 1 when j sc i. (7b) 

and 


B = B . 


V J' 


(7c) 


where the asterisk denotes the complex conjugate. Further. 

bp = by- (7d) 

The constraint (7a) follows at once from the fact that, when j = i. pf\ pi - pj, a>) is just 
the usual correlation coefficient which necessarily has the value unity when pi - 
pj =0. The inequality (7b) follows from the fact that the correlation coefficient cannot 
exceed unity in absolute value. 1 The relations (7c) and (7d) follows from the fact that 


See Appendix A in O. Korotkova, M. Salem and E Wolf, Opt. Common. 233 (2004), 225-230. 
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Wy* 1 ], r 2 , o>) = W*i( r 2 , r h to), which is an immediate consequence of the definition of the 
correlation matrix W, The variances 6jj and the coefficients B : have to satisfy some addi¬ 
tional constraints, which are consequences of the fact that the correlation matrix is neces¬ 
sarily non-negative definite. Such constraints have been discussed in several papers. 1 In 
addition there are also constraints that have to be satisfied in order that the source generates 
a beam. If 


the constraints are 2 


cr x = cr Y = a 


( 8 ) 


-L + A*^, 

4cr 2 X 2 


(i = x,y\ j = x, y). 


(9) 


It is shown in Appendix III that when the condition (8) is satisfied, i.e. when the r.m.s. 
widths of the spectral densities S ( ”'(p', w) and S ( , 0) (p , to) of the .v and y components of the 
electric field at each point in the source plane ; = 0 are equal, the spectral degree of polar¬ 
ization 2*-*(p, to) is independent of p, i.e. it has the same value at every point of the source 
plane. 

Let us now consider an electromagnetic Gaussian Schell-model beam, i.e. a beam gen¬ 
erated by a Gaussian Schell-model source. 1 Its cross-spectral density matrix W(pj, p^, z\ to) 
may be calculated by substituting from Eqs. (5) and (6) into the propagation law [Eq. (3)]. 
with Green's function G(p - p\ z: to) in the propagation kernel Alp, — pj, p 2 — p 2 ; z, tu), 
defined by Eq. (4), being given by the paraxial approximation (2). One finds after a straight¬ 
forward but long calculation that, assuming that the constraint given in Eq. (9) applies, 


W.(p,, p 2 ; z, w) 


AA B 
l j u 

A 2 (z) 


exp 


(Pi + p 2 ) 2 

%cr 2 A 2 (z) 


exp 


(P : ~ Pij~ 

2Ct 2 A 2 (z) 



ik(& - p;) 

2R..(z) 
ij' 1 


( 10 ) 


where the quantities A|(z), sometimes called the beam-expansion coefficients, are given by 
the expressions 


A 2 (z) 


x’ 

Z 

kail- 

'J 



1 J_ 

4tr 2 + S 2 . 
ij 


(11a) 


1 See F. Gori. M. Santarsiero. G. Piquero, R. Borghi. A. Mondello and R. Simon. J. PttreAppl. Opt. 3 (2001), 1-9 
and H. Roychowdhury and O. Korotkova, Opt. Common. 249 (2005), 379-385 which provides a somewhat 
more general treatment. 

2 O. Korotkova, M. Salem and E. Wolf. Opt. Lett. 29 (2004), 1173-1 175. 

2 Methods for generating Gaussian Schell-model sources and Gaussian Schell-model beams have been described 
by G. Piquero, F. Gori. P. Roumanini, M. Santarsiero, R. Borghi and A. Mondello in Opt. Common . 208 (2002), 
9-16 and T. Shirai, O. Korotkova and E. Wolf, J. Opt. A: Pure Appl. Opt. 7 (2005). 232-237. 
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and 


R (-) 

IJ v ' 



(lib) 


Of special interest is the case in which the field points are in the far zone. One then finds 
that, provided that Eq. (10) holds, the elements of the cross-spectral density matrix of the far 
field (denoted by superscript oc) are given by the formula 1 


W. (r.s.. r 1 s 1 , to) = k 2 cos6. cos 9, 
y 1 - (a 2 . - b 2 ) 

' n n ' 


A. A B 

J u exp|~2fc 2 [a- 


/?/,(s, ’ s 2 )]} 


X 


exp[iA:(r 2 - r } )] 

Vi 


U = v, y; j = x, y). 


( 12 ) 


as kr | —> oo and kr 2 —> co , with the directions, specified by the unit vectors s, and s 2 . being kept 
fixed (see Fig. 9.3) and with 


where 


a 

i.i 


IJ 4 (a 2 . - b 2 ) 

ij ij ' 


b . 

= -2-, 

,J 4 (a 2 . - b 2 .) 
v v >j J 


]_ 

2 




03a) 


(13b) 



Fig. 9.3 Notation relating to the arguments of the correlation matrix at points in the far 
field of an electromagnetic Gaussian Schell-model source. 


O. Korotkova, M. Salem and E. Wolf. Opt. Lett. 29 (2004). 1173-1175. 














186 


A unified theory of polarization and coherence 


9.4.3 Examples of correlation-induced changes in stochastic 
electromagnetic beams on propagation 

Equation (10) gives the elements of the cross-spectral density matrix at pairs of points in 
any cross-section z = constant > 0 of electromagnetic Gaussian Schell-model beams of a 
wide class, namely those for which the constraint given by Eq. (8), i.e. a x = cr y = a, holds. 
As we have already noted, the constraint ensures that the degree of polarization is the same 
at every point in the source plane z = 0. 

On substituting for the matrix elements given by Eq. (10) in Eqs. (2) and (14) of Section 
9.2 one can determine the changes in the spectral density S(r, w) and in the spectral degree 
of polarization lP(r, a>) throughout the half-space z> 0 into which the beam propagates. 
Using Eq. (10) one may also determine changes in the complete state of polarization of the 
beam at each point in that half-space, i.e. the shape and the orientation of the polarization 
ellipse of the polarized portion of the beam. We will discuss such changes shortly. 

For simplicity we will assume that the x and y components of the electric field in the 
source plane are uncorrelated, i.e. that 

Rj' “ Pp ") = P2 ~ Pr 0>) = °- ( 14 ) 

According to Eq. (6b) this implies that 

B X y = B yx = 0. (15) 

and it is clear from Eq. (5) that, in this case, 

WW(Pp P4 OJ) = wW(p[, p'- o» = 0, (16) 


i.e. the correlation matrix is now diagonal. 

According to Eqs. (10) and (7a), the elements of the cross-spectral density matrix of the 
field in any cross-section perpendicular to the beam axis are given by the expressions 


W,(P r P 2 - ?, oj ) 



exp 


pf + p? 

4cr 2 A^(") 


exp 


(R: P ,) 2 

2 6 2 A 2 (r) 

XV XV v ' 


exp 


iA(p| - p 2 ) 


(17a) 


w vv (p,, p 2 . r, w) 


A 2 

V 

Pf + P’’ 


(p. - p,) 2 


i^(p 2 - Pf) 



exp 


exp 


A 2 U) eXP 

yy v / 

4cr 2 A 2 v (z) 

2^A^(s) 

2® w (z) 


(17b) 


w„(p,, p 2 , z; w) = W VA .(P], p 2 , z; w) - 0. 


(17c) 
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In the formulas (17a) and (17b) the beam expansion coefficients are given by the formulas 


= 1 + 


{kerf 


1 , 

4 ( 7 “ + 8 2 _ 


z~. 


(18a) 


A?,fr) = l + 


(ka) 2 


4a 2 8 2 


(18b) 


and, according to Eq. (lib). 




1 + 


A 2 (“) 


(19a) 


$ (z) 

VV 


1 + 


A 2 (z) 

VV ' ' 


(19b) 


Let us now make use of these expressions to elucidate, by means of a few examples, correlation- 
induced changes in an electromagnetic Gaussian Schell-model beam on propagation. 

We have from Eqs (2b) and (17) that 


S(r. m) = Tr W(p, p, a>) = lV x (p. p, a>) + W vv (p. p, 


A 2 

P 2 

A 2 

, v 

i 

p- 

A 2 „U) "‘ P 

2(t 2 A 2 y(s) 

A 2 /;) CXP 

2a 2 A 2 n .(z) 


( 20 ) 


Since in the present case the correlation matrix is diagonal, we obtain, on using Eqs. (17), 
the following expressions for the determinant: 


Det W(p. p, r> cd) = W (p. p, z; to) W (p, p, z: to) 


A 2 A 2 


1 1 

p 2 

A 2 v( ;)A 2 v (z) CXP 


A 2 (z) ' A 2 (z) 

xx v 7 yy v W J 

2a 2 


( 21 ) 


On substituting from Eqs. (20) and (21) into Eq. (14) of Section 9.2, we obtain an expres¬ 
sion for the degree of polarization at any point in the beam. A few examples of such changes 
in planes | = constant > 0 are also shown in Fig. 9.4. The behavior of the spectral degree 
of polarization along the axis is shown in Fig. 9.5. The considerable differences between 
these figures reveal that the changes in the degree of polarization on propagation depend 
very sensitively on the values of the parameters that specify the source. 
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Fig. 9.4 The normalized intensity (spectral density) S ;V (p. t; to) and the spectral degree 
of polarization tP(p. to) of a stochastic electromagnetic beam propagating into the 
half-space z > 0. The field in the source plane z = 0 is represented by a cross-spectral 
density matrix with elements 


W t (p'. p 2 '. to) = F(p{, p,', to)cos 2 6. 


W. y (p. p,', to) = F(p[, p,'. to)exp 


cp.' - p'r 


2d 2 


sin-p. 


wjp;, p,’, w) = w ' (p,', p,'. up = o. 


with 


Ftp p!, to) = S 0 (to)exp 


i»; 

4tT 2 


The source generates a linearly polarized, spatially coherent Gaussian electromagnetic 
beam, with polarization angle 0 and with the effective width <7 S , incident on a random 
phase screen. The parameter 6 yy is a constant relating to the properties of the screen. The 
values of the parameters were taken as o s = 1 mm. 6 yy = 0.1 mm and A = 632.8 nm. The 
distance of propagation ? is normalized by the Rayleigh range : R = 2 ka] )k = to/c). 
[Reproduced from T. Shirai and E. Wolf. J. Opt. Soc. Amei: All (2004). 1907-1916.] 
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Fig. 9.5 The spectral degree of polarization fP(0, z, to) of a stochastic electromagnetic 
beam, with the same parameters as in Fig. 9.4, along the axis of the beam (the c-axis), for 
some selected values of the polarization angle 0. The propagation distance is normal¬ 
ized by the Rayleigh range - R = 2ka 2 , of the incident beam. [Reproduced from T. Shirai 
and E. Wolf. J. Opt. Soc. Amer. A21 (2004), 1907-1916.] 


It may seem surprising that, in general, the degree of polarization changes as the beam 
propagates. The reason for such changes is the difference between the correlation coeffi¬ 
cients which specify the correlation between the x and the y components of the electric field 
vector at the two source points. Unless the correlation coefficients are the same, the expansion 
coefficients A^/z) and Aj^z), defined by Eqs. (18), have different values. Consequently 
the diagonal elements W xx and W vy of the correlation matrix, given by Eqs. (17), expand 
at different rates. This leads to different behaviors of the two matrix elements, resulting in 
changes in the degree of polarization as the beam propagates. 

One might expect that not only does the degree of polarization, in general, change on prop¬ 
agation but also that the shape and the orientation of the polarization ellipse of the polarized 
portion of the beam change. This indeed is the case. 1 Because the derivation of the pertinent 
formulas is rather lengthy, we will only state them here. 

One finds that, in terms of the elements of the correlation matrix, the squares of the 
major and the minor semi-axes of the polarization ellipse are given by the formulas 


« 2 (p, z; w) = - 

O 


b 2 ( p, z; to) = 


- W,v) 2 + J(W XX - tty 2 + 4 [%eWJ 2 

- WJ 2 + 4|w n .| 2 - J(W n - WJ 2 + 4 IZeWJ 2 


. (22a) 


(22b) 


where denotes the real parts. 


See O. Korotkova and E. Wolf, Opt. Comnun. 246 (2005), 35-43. 
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The angle 0 {) which the major axis of the polarization ellipse makes with the .r-axis is 
found to be given by the formula 


BAp, z\ u>) = — aictan 


2%lT (V (p, r, (’>) 

WJv, z; U>) - Wjp, z: CO) 


{-tt/2 < e n < tt/2). (23) 


An example of changes in the polarization ellipse of an electromagnetic Gaussian Schell- 
model beam, calculated from these formulas, is given in Fig, 9.6. The corresponding val¬ 
ues of the degree of polarization are also indicated in that figure. 



Fig. 9.6 Changes in the polarization ellipse and the degree of polarization associated 
with atypical Gaussian Schell-model beam on propagation in free space. [Reproduced 
from O. Korotkova and E. Wolf. Opt. Coimmtn. 246 (2005), 35—13.] 
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9.4.4 Coherence-induced changes of the degree of polarization 
in Young’s interference experiment 1 

In the previous section we studied correlation-induced changes in the polarization proper¬ 
ties of a Gaussian Schell-model beam on propagation in free space. Somewhat similar changes 
may take place when two correlated beams are superposed, for example, in a Young inter¬ 
ference experiment. In this section we will briefly discuss the change in the degree of polar¬ 
ization in Young's interference pattern as the degree of coherence of the light incident on the 
pinholes is changed. 

Consider a stochastic, statistically stationary electromagnetic beam propagating close to 
the 2 -axis and suppose that an opaque screen 24 is placed across the plane 2 = 0 with small 
openings at points Q\(pf and Q4 p 2 ) and that measurements are made at a point Pi r) in the 
observation plane ®, which is parallel to the screen J4in the half-space 2 > 0 (see Fig. 9.1). 

Let {E(p b «)} and {E(p 2 , «)} be the statistical ensembles of the electric field vectors at 
the two pinholes. If {E(r. «)} denotes the statistical ensemble of the electric field vector at 
the point Pir) then we have for each member of this ensemble, as in Eq. (1) of Section 9.2, 

E(r. w) = A^Elp,, w)e iM > + tf,E(p 2 , co)e m , (24) 

where K, and K 2 are the same geometrical factors as we encountered before [Eq. (3) of 
Section 3. 1 (with A now replaced by A)]. We again assume that the angles of incidence and 
of diffraction at each pinhole are small. 

To determine the degree of polarization at a point P(r) in the observation plane 'B we 
must first determine the elements of the correlation matrix 

W(r, r, a>) = [W.(r, r, «)] = [(£*(r, «)£.(r, to))]. (i = .r. y\j = v,.v). (25) 

On substituting from Eq. (24) into Eq. (25) we readily find that 

W y (r, r, m) = Wy( p, p. w) + |a: 2 |" W (p,, ps, to) 

+ K\K 2 W ..( P| , p,, co)c iM 2-V + K,k;w & ( p,, p, tn)e- ik(R ^K (26) 

where 

W.(p. p,, w) = (E j (p. w)Ej{ p 2 . w)), (i = x. y:j = x.y) (27) 

are the elements of the cross-spectral density matrix whose arguments are the points Q ( (p,) 
and Q4 p 2 ) where the pinholes are located. 

Suppose that the beam incident on the pinholes is an electromagnetic Gaussian Schell- 
model beam, which we discussed in Section 9.4.2. The spectral densities Sf\ p, a>) 


The analysis of this section is based on a paper by H. Roychowdhurv and E. Wolf. Opt. Commun. 252 (2005). 
268-274. 
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(j = a\ y), of the x and y components of the electric field incident on the pinholes are then 
given by Eq. (6a) and the correlation coefficients pj^fpj - p,. to) are given by Eq, (6b), 
For simplicity we will assume that the spectral densities of the x and the y components of 
the electric field vector at the pinholes are the same. Then in Eq. (6a) 


Aj. = A y = A. 


(28) 


In general A will depend on the frequency to. We will also assume that in Eq. (6b) for the 
correlation coefficients p™'(p 2 - pj, to). 


B = 

u 


1 

B {) (real) 


if i = J- 
if J * i. 


(29a) 

(29b) 


and that 

4v = 6 „ = 6 . 


(30) 


The cross-spectral density matrix of the electric field at the pinholes may then readily be 
shown to be given by the formula 


W- 0j (pp p,. to) 


A 2 exp 


exp 

((T “ P,) 2 

B q exp 

(p. - p|) 2 ] 

2 8 2 

2b 1 

XX 

B 0 exp 

(Pj - Pi ) 2 

K 

exp 

(p 2 - Pi ') 2 

2< ^ 


(31) 


where we have used the relation 8 yx - 6 xy [Eq. (7d) of Section 9.4.2], On substituting from 
Eq. (31) into Eq. (8a) of Section 9.2 one finds that the spectral degree of coherence of the 
electric field at the tw'o pinholes is given by the expression 


t; (0) (p, p, to) = exp 


(P; ~ P, ) 2 

2S 1 


(32) 


If the pinholes are placed symmetrically w'ith respect to the z axis, then p 2 = — p, and the 
expression (32) for the spectral degree of coherence becomes 


?/ 0, (p,. -pj, to) = exp 



(33) 


An expression for the degree of polarization at each pinhole is readily obtained on sub¬ 
stituting from Eq. (31) into the general expression (14) of Section 9.2. One finds that 


!P (0) (p„. to) = B 0 (a =l,2). 


(34) 

























9.4 Changes in random electromagnetic beams 


193 


Since this formula shows that the degree of polarization has the same value. S„. irrespec¬ 
tive of the location of the pinholes, we will suppress the dependence of !P ,0) on p (t and will 
write 

!P ,0, (p a , w) = 2> 0 (35) 

In order to determine the spectral degree of polarization ‘P( r. to) at a point in the fringe pat¬ 
tern, we need to determine the elements of the matrix W at a point P( r) in that plane. On 
substituting from Eq. (31) into Eq. (26), we find that, with the symmetric location of the 
two pinholes, i.e. with p 2 = —p|. 


W..(r, r. a>) = 


A 2 exp 


A 2 exp 


Pf_ 

2 cr 2 


AT, 1“ + \K, + 2^(A’ 1 ^e iA,fl ^ s i ) )exp 


when i = y, 

2 

_ I Y 

I 


Pf 

2a 2 


/C,f + \K 2 [ + 2 Pe( K* K ^e' k( R ~ )e\p 
when i + j. 


s 2 


2 3L 

s 2 . 

ij 


It is convenient to set 


Sj( r, to) = \Kj\ 2 A 2 exp 


2(7 2 


u = 1 2 ). 


(36a) 


fp 


(36b) 


(37) 


which represents the spectral density at frequency io of the held at the point r in the plane of 
observation. The formulas (36) may then be rewritten in the form 


IT.(r. r, to) 

S ,(r to) + S,(r, to) + 2 2 js i (r. to)^S 1 (r w)cos [k{IL, - R f )]exp 



when i = j , 

5,(r, w) + S,(r, to) + 2^5, (r, to)yjS-,( r. w)cos [k(R 2 
when / + j. 


R } )]exp 



(38a) 

fP 

(38b) 


On substituting from Eqs. (38 ) into Eq. (14) of Section 9.2 we obtain the following expres¬ 
sion for the degree of polarization in the plane Sof the interference pattern: 

T(r, to) 



S 2 

■y 

Sj(r, «) + S 2 (r. w) + 2^5,(r, to)^ 

! S n ( r. to)cos[k{R 2 — K^exp 


2fV 

8 2 

5,(r, co) + §l,(r, co) A 2^,(r, co )^ 

^S 2 (r, to)cos[k(R^ — /?| )]exp 


(39) 
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According to Eq. (33) the exponential factor in the last term in the denominator is just the 
spectral degree of coherence f/ 0l (pi, — pj, lo) of the light at the two pinholes. Hence Eq. (39) 
may be rewritten in the form 


T{r, oj) 

Sj(r. oj) + S-,(r, oj ) + lyjSfir. oj)^S 2 )r. to)cos[k(R^ - /?,)]exp 
S^r, to) + S,(r, oj) + 2^5 1 (r. oj)yjs x ( r, a>)cos[A(7?-, - R } )]r/ 0 '(p,. 


2p 2 


-p, OJ) 


% 


O' 


(40) 


Frequently the spectral densities 5,(r. oj) and S 2 ( r, oj) of the light reaching the point P(r) 
in the plane (Swill be approximately the same. i.e. .S 2 (r, o>) ~ S |(r, oj). Equation (40) then 
reduces to 


(P(r, oj) 


1 + cos \k(R-, — )jexp 
T + cos[k(R 2 - 



p. oj) 


-L r 


O' 


(41) 


This formula shows that the degree of polarization in the detection plane (3 depends not 
only on the degree of polarization (P 0 of the light at the pinholes but also on the spectral 
degree of coherence tf n) of the electric field at the pinholes and on the parameter 8 xy which 
characterizes the correlation between thex component of the electric field at one of the pin¬ 
holes and the y component of the electric field at the other pinhole. This result brings into 
evidence the subtle relations which exist between polarization and coherence of stochastic 
electromagnetic beams. 

Figure 9.7 shows the behavior of the spectral degree of polarization (Pat the axial point 1 
(i.e. on the beam axis) in the plane of observation 'B, when the spectral degree of coherence 
?/ 0, (p], — p|, to) at the pinholes is varied while the degree of polarization (P 0 at each pinhole 
is kept constant. 

The effect of the degree of coherence of light at the pinholes on the degree of polariza¬ 
tion in Young's interference experiment was tested and confirmed experimentally. 2 


9.5 Generalized Stokes parameters 3 

In Section 8.1 we introduced the Stokes parameters of a random electromagnetic beam. 
They were defined in terms of the mutual intensity matrix J by Eq. (14a) of that section, i.e. 


1 The behavior of the spectral degree of polarization at off-axis points is discussed in Y. Li, H. Lee and E. Wolf. 
Opt. Commun. 265 (2006). 63-72. 

2 F. Gori, M. Santarsiero. R. Borghi and E. Wolf. Opt. Lett. 31 (2006). 688-690. 

The results presented iri this section are based on a paper by O. Korotkova and E. Wolf. Opt. Lett 30 (2005), 
198-200. 
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?(r, a>) (in the observation plane 8) 

Fig. 9.7 Behavior of the spectral degree of polarization T( r. of), at the axial point in the 
plane of observation 8, when the spectral degree of coherence // l0l (p 1 , —p 1( to) of the 
light at each pinhole is changed, while the degree of polarization 2> 0 at each pinhole is 
kept fixed. The curves are associated with different values of the parameter f rv which 
characterizes the correlation between the a component of the electric field at one pin¬ 
hole and the y component of the electric field at the other pinhole. The values of the 
parameters were taken as 8 {) = 0.5. * = 10cm, /?, = 2 mm, 


(a) = 5 . 



(b) 6 = - 

3 + -fL 

= 1.1(5, 

4 



fe) <5 = — 

3 + —!= 

= 1.25, 

- rv 2 

A 


(d) 8 = - 

>+-* ’ 

= 1.35. 

■» 4 

vv 


(e) <5 = —^ 

xy r 

= = 1,4<5 



[After H. Roychowdhurv and E. Wolf. Opt. Commun. 252 (2005). 268-274.] 
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in terms of equal-time correlations between mutually orthogonal components of the elec¬ 
tric field at one point. It is also possible and useful to define analogous quantities in the 
space-frequency domain by somewhat similar formulas, in terms of the cross-spectral den¬ 
sity matrix, viz.. 


s 0 (r. co) = W tA .(r, r, co) + W vv (r, r, co), 

,Vj(r, co) = W Or, r, co) — W (r, r, co), 

s,(r, co) = W' (r, r, co) + W yt (r, r, co), 


(la) 



or. more explicitly, in terms of the spectral components of the electric field, 


,v () (r, co) = {£*{ r, io)EJr, co)) + (E*( r, co)E y ( r, co)}, 

>Vj(r, co) = (£*(r, a>)E x ( r, w)} - £ v tr. w)E y (r, co)), 

s 2 (r, co) = (E* x ( r, co)E y (r, w))+(£*(r, w)E x (r, w)), 

s 3 (r, co) = i[(£*(r, co)EJr. co)) - (E*( r, co)EJ r, co)) 


(lb) 


These four parameters may be called the spectral Stokes parameters. They can be deter¬ 
mined experimentally in a similar way to how one determines the usual Stokes parameters, 
provided that the light is filtered to become quasi-monochromatic around the frequency co. 

In Section 9.4.1 we showed how' the cross-spectral density matrix of a stochastic electro¬ 
magnetic beam changes on propagation. In this section we will briefly consider how' the 
spectral Stokes parameters change as the beam propagates. For this purpose it is, however, 
necessary to generalize the usual Stokes parameters, which depend on one point, to quan¬ 
tities w'hich depend on two points. Such generalized Stokes parameters may be introduced 
by formulas of the form given by Eq. (1), but with the two equal spatial arguments (r, r) on 
the right-hand side replaced by two unequal spatial arguments (iq, r 2 ). Thus, in place of the 
formulas (la) we now have 



(2a) 


More explicitly. 



(2b) 
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If the beam propagates in free space, the cross-spectral density matrix W(r,, r 2 , to) prop¬ 
agates from an initial plane z = 0 according to Eq. (3) of Section 9.4.1. It follows at once 
from Eqs. (2a) for the generalized spectral Stokes parameters that these parameters propa¬ 
gate according to the same law, viz., 

S a (r r r 2 ,co) 

= ff, =ll s a )( Pi’' Pr W ^(P| - Pi'’ ft - Py z; w)d 2 p[ d 2 Py (a = 0. 1,2, 3), 

(3) 

where r, = (p ]a z); r 2 = (p 2 , z) and the propagator /f(p, - pj, p 2 - p 2 , z; w) is given by 
Eqs. (4) and (2) of Section 9.4.1. 

If we set r, = r 2 = r in Eq. (3) the formula becomes, with r = (p, z), 

.rjr. to) = SJr, r. w) = Py >»)K( p - p - p 2 , z; w)d z p\ d 2 p'. (4) 

This formula expresses the ordinary spectral Stokes parameters s a (r, w) at any point in the 
half-space z > 0 in terms of the generalized spectral Stokes parameters S a at all pairs of 
points in the plane z = 0. Evidently knowledge of the ordinary spectral Stokes parameters 
sjr, w) at all points in the source plane is not adequate to determine the values of those 
parameters throughout the half-space z > 0. In fact two stochastic electromagnetic beams 
which have the same Stokes parameters sjfi, w) in the source plane z = 0 may produce 
beams of different degrees of polarization throughout the half-space z > 0. 1 

It is clear that the generalized spectral Stokes parameters, just like the cross-spectral den¬ 
sity matrix, can be used to elucidate the spectral properties, the polarization properties and 
the coherence properties of a stochastic electromagnetic beam. 

In Figure 9.8 examples are given of the changes of the spectral Stokes parameters of an 
electromagnetic Gaussian Schell-model beam on propagation in free space, calculated 
from Eqs. (4) and (2a). 

Similar calculations can be carried out for propagation of the spectral Stokes parameters 
not only in free space but in any linear medium, deterministic or random. One must then 
use in Eq. (4) the kernel appropriate to propagation in that medium. 


PROBLEMS 

9.1 Two unpolarized, mutually uncorrelated stochastic, statistically stationary beams prop¬ 
agate close to the z axis into the half-space z > 0. {E ,A, (r, «)} and {E (Bj (r. w)} are the 
statistical ensembles representing each beam. Further, 

Wfj A> = (E\ h) *E {A) ), = {£' B) *£ , '. B) ) 


An example of this kind is given in M. Salem. O. Korotkova and E. Wolf. Opt. Lett. 31 (2006). 3025-3027, where 
tiie difference in the degrees of polarization of the two beams is due to the difference in the degrees of coherence 
of the field in the source plane * = 0. information which is not contained in the ordinary Stokes parameters sj. 
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i 



0 


0 


10 ° 10 1 10 2 10 3 
Distance of propagation z (m) 


Fig. 9.8 Changes of the Stokes parameters sq, jr x . s 2 and of an electromagnetic Gaussian 
Schell-model beam on propagation in free space. The parameters characterizing the source 
have been taken as ai = 3 X 10 15 FIz (A = 6328nm|, = 1.5, A y = 1, 6 = argB n . = it/6. 

|S^| = 0.35. a = 1 cm, 6 ry = 0.2 mm, 6 XV = 0.25 mm and = 0.15 mm. [Reproduced 
front O. Korotkova and E. Wolf, Opt. Lett. 30 (2005), 198-200. J 


and 



x and y being mutually orthogonal directions perpendicular to the z direction. The angu¬ 
lar brackets denote the ensemble average. 

(a) Derive an expression for the cross-spectral density matrix of the total field. 

(b) Show that the total field may be partially polarized. Under what condition will the 
total field be unpolarized? 

9.2 A thermal light beam, with axis along the z direction, is incident on two detectors 
located at points PdPi) and P 2 ( P 2 ) in a plane z = constant. Assuming that the 2X2 
cross-spectral density of the beam is symmetric in the variables pj and p 2 , show that 
the correlation between the intensity fluctuations A/(p ( , w) = I(p p to) — (7(p p to)) 
(j = 1, 2) is given by the expression 


(Afrp,, w)A/(p 2 , to)) = Tr[W (p b p 2 , m)W(p,, p 2 , «)], 


where W(pj, p 2 , to) is the cross-spectral density matrix of the beam, and the dagger 
denotes the Hermitian adjoint. 


9.3 Derive an expression for the degree of coherence of a planar, secondary, electro¬ 
magnetic Gaussian Schell-model source. 
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9.4 Show that the degree of coherence at any pair of points in a cross-section of an electro¬ 
magnetic Gaussian Schell-model beam tends to unity with increasing distance of prop¬ 
agation in free space. 

9.5 Consider a beam generated by a Gaussian Schell-model source located in the plane 
z = 0 and linearly polarized along the x direction, with spectral densities 


SfHco) = 


and correlation coefficients of the electric field 



/'.vy( P,- P : - w > = 0. 


In these expressions A , B, a, S x and S y are independent of position and of frequency and 
the source is assumed to be large relative to a. Let 


S( r. to) 


s(r, to) = 



be the normalize spectral density at the point r. Derive an expression for .sir, to) at points 
in the far zone. 

Plot the normalized source spectrum and the normalized far-zone spectrum in direc¬ 
tions making angles 9 = 0° and 6 = 0.3° with the normal to the source plane, when 
A = 1 . a = 0.2w and 6 = 0.5 mm. 

Determine and plot also the degree of coherence if 0) (p,, p 2 , to ) of the electric field 
in the source plane at pairs of points located symmetrically with respect to the beam 
axis, as a function of the separation |Ap| = 2|pj( = 2|p ; |. 

9.6 Consider a stochastic electromagnetic beam generated by a planar, secondary source, 
located in the plane z = 0 and propagating about the ’-axis into the half-space - > 0. 
The spectral densities Sp((o) and S! 0) (w) of two mutually orthogonal components of 
the electric field in the source plane are assumed to be independent of position. 

Show that the spectral density of the field at a point r in the half-space z > 0 may be 
expressed in the form 


Sir. oj) = SfHw)M x (r, to) + SfHto)M y ( r, to). 


Derive expressions for M x ( r. to) and M,( r, to). How do these expressions simplify 
when the point r is in the far zone? 
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Show that, if the components E x and E y of the electric field in the source plane are 
uncorrelated. the above formula can be expressed in the form 

S( r, co ) = 5J 0) (w)[M i (r, co) + a-(r, <a)M y ( r, to)]. 


where 


a(to) 


1 - P(°>(aj) 

1 + l pl°>(at) ’ 


&°Ka>) being the degree of polarization of the field in the source plane. 

9.7 &q denotes the unit matrix and er,, cr 2 and cr 3 are the so-called Pauli spin matrices. The 
four matrices are defined as 


I 

0 

, or, = 

i 

0 

, cr 0 = 

0 

l 

CT 3 = 

0 i 

0 

1 

i 

0 

-1 

’ 2 

l 

0 

-i 0 


(a) Show that 

Tr(ov • cr k ) = 2S jk (j, k = 0,1, 2, 3), 

where S jk is the Kronecker symbol (Sj k = 1 when k = j\ Sj k = 0 when k + j). 

(b) Show with the help of the above relations that the spectral Stokes parameters and 
the cross-spectral density matrix are related by the formulas 


Sj( r, w) = Tr{W(r, r, w) • 5 .} 


and 


W(r, r, <o) = iy A s j( r 9 

i=i 

9.8 .Sj(r. co) (j = 0, 1, 2, 3) are the spectral Stokes parameters of a quasi-monochromatic, 
stationary light beam. The matrix 


S(r, oi) 


V r - 

SjCr, w) 
i,(r, w) 
.s 3 ( r, to) 


is said to represent the (spectral) Stokes vector of the beam. 
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Suppose that the beam passes through a linear, non-image-fonning device. Let S'(r, to) 
denote the Stokes vector of the emerging beam. Then 

S'(r, to) = M(r, w)S(r, w), 

where M is a 4 X 4 matrix known as the (spectral) Mueller matrix. 

Express the elements of the Mueller matrix in terms of the elements of the instru¬ 
mental matrix, defined in Problem 8.6. 

9.9 Show that the two stochastic electromagnetic beams which propagate from the source 
plane z = 0 into the half-space z > 0 may have different degrees of polarization through¬ 
out the half-space, even though they have the same sets of Stokes parameters in the source 
plane. (Hint: use the generalized Stokes parameters to characterize the source.) 



Appendix I 


Cells of phase space and 
the degeneracy parameter 


(a) Cells of phase space of a quasi-monochromatic light wave (Section 1.4) 

We mentioned towards the end of Section 1.4 that the concept of coherence volume has a 
counterpart in the quantum theory of radiation, known as a cell of phase space. In this 
appendix we will define the cell of phase space and also introduce a related concept of 
degeneracy parameters of radiation. 

Let us first consider a monochromatic plane wave of wavelength A. According to one of 
the de Broglie relations (B&W, Appendix II), we may associate with it photons of momen¬ 
tum p, whose magnitude is 


where h is Planck's constant. Suppose that r = (x, y, z ) specifies the location of a photon. 1 
According to elementary quantum mechanics, the location and the momentum of a photon 
cannot be measured simultaneously with an accuracy greater than allowed by the 
Heisenberg uncertainty relation, viz., 2 

A.vA p x >h, AyAp y >h, A:Ap. > /;. (2) 

Let us introduce a six-dimensional space, called the phase space, in which points are 
specified by values of the six variables x, y, z, P x - P y , P : In view of the inequalities (2) it is 
natural to divide this space into regions of volume elements, called cells , of size 

A.r Ay Az A p x A ,p y A p, = h 3 . (3) 

Clearly photons of the same spin state (same polarization) belonging to a region of the phase 
space which is not greater than indicated by Eq. (3) are intrinsically indistinguishable. 

1 The position of a photon cannot be specified more accurately than to distances of the order of the wavelength 
but here we will ignore subtleties concerning photon localization. 

2 The uncertainty relations (2) are often written with a factor h/2 (h = hl{2ir)), rather than h on the right-hand 
sides. The value of the factor depends on the exact definitions of the uncertainties A.v, Ap„ etc. For our purposes 
definitions which imply the inequalities (2) are convenient. 
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We will now show that the concept of coherence volume which we introduced in Section 1.4 
from considerations based on classical wave theory is the volume Ac Ay Ac of ordinary space 
given by the formula (3), subject to the constraints imposed on the product Ay, Ay, A p- by 
the geometry and by the bandwidth of the light. Stated somewhat differently, we will show that 
the coherence volume is that region of space throughout which the photons are intrinsically 
indistinguishable. To justify this statement let us first estimate the uncertainty in the compo¬ 
nents of the momentum of a photon in the far zone of a field generated by a planar, quasi- 
monochromatic, thermal source <r, of linear dimensions 2a located in the plane z = 0 and 
radiating into the half-space z > 0. Let 2<f> denote the angle which the source subtends at a 
point Q in the far zone, assumed, for simplicity, to be small and situated on the normal to cr. at 
a distance R from it (Fig. 1). There will be uncertainties A p x and Ay, in the x and the y com¬ 
ponents of the momentum of the photon arriving at Q , because of the lack of knowledge 
regarding the source point from which the photon was emitted. Evidently these uncertainties 
are represented by the projections of the photon momentum p onto the .v and the y axes, i.e. 

-V. = Ay,. = 2 pep. (4) 

Using Eq. (1) and assuming <fi to be sufficiently small, one has 


A p x = A p y = 2 pf 



2 /? a 

T R ’ 


(5) 


where A is the mean wavelength of the emitted light. Since <p was assumed to be small, the 
uncertainty of the z component of the momentum arises mainly from the uncertainly in 
the wavelength. If AA is the effective wavelength range of the light, then it follows from 
Eq. (1) that 


Ay. = 



( 6 ) 



Fig. 1 Notation relating to the derivation of Eq. (5 ) of Appendix I. 
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Hence we have from Eqs. (5) and (6) 

Ap Ap Ap = — , 

' v ~ A 4 fl 2 


(7) 


where S = (2a) 2 is the order of magnitude of the size of the source. On substituting this 
expression into the expression (3) for a cell of phase space, we see that the volume of the 
space around the point Q throughout which the photons emitted by the source are intrinsi¬ 
cally indistinguishable is 


Ay A v A; 


lf_ 

S 


AX 


A 3 . 


( 8 ) 


Comparison of Eq. (8) with Eq. (2a) of Section 1.4 shows that the right-hand side is equal 
to the expression for the coherence volume derived by considerations based entirely on 
classical theory. Thus we have justified our earlier assertion about the quantum-mechanical 
significance of the coherence volume. 


(b) Cells of phase space of radiation in a cavity (Sections 7.4 and 7.5) 

The concept of cells of phase space was originally introduced not for a light beam but 
rather for thermal radiation in a thermally insulated cavity, i.e. for blackbody radiation. It 
plays an important role in the theory of photoelectric detection of light fluctuations, for 
example, which is discussed in Section 7.5. In this appendix we will derive an expression 
for the number of cells of phase space for this situation. 

Using Eq. (3) let us calculate the number Z, say, of cells of phase space associated with 
photons in some finite momentum range contained in a volume V of the cavity. Evidently 

7 = Txf d P.x d P y d A dv d >’ d ~' (9) 

where the integration extends over the accessible domain of the phase space containing the 
photons. The factor 2 in front of the integral arises from the fact that one must distinguish 
between two spin states. 

Consider photons of the same spin, whose energies lie in the range between 
E = hv and E + d£ = h(v + dv). 

The magnitude of the corresponding momenta will lie in the range between 

E hv , , h(v + dv) 

p = — = — and p + dp = -, 

c c c 

where c is the speed of light in vacuum. 
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Since we are considering radiation under equilibrium conditions, there is no preferential 
direction of propagation of the photons. This makes it possible to determine the contribu¬ 
tion to the integral in Eq. (9) from the momenta in a simple manner as follows. Consider a 
spherical shell in momentum space indicated on the figure below: 



The contribution to Eq. (9) from the shell is evidently 
f d p x dp, dp = 4irp 2 dp 


477 


h . 
— av 


hv 
c 

/j3 

4tt — v 2 Av. 
c 3 


( 10 ) 


The contribution from ordinary space (called also configuration space) is 

f dbcdydz = V. ( 11 ) 

On substituting from Eqs. (10) and (11) into Eq. (9) we obtain for the number Z of cells 
in phase space the expression 

2 /; 3 

Z = — 47t —— v 2 dv V, 
h 3 c 3 


t.e. 


7 SttT - 
Z = —— y- dn 


( 12 ) 


The size of the region in the cavity throughout which the photons are indistinguishable 
is, according to Eq. (12) with Z = I, given by the expression 


V = 


8tt 


AA 


A 3 , 


(13) 
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where we have used the relation v = c/A. Equation (13) is of the form of Eq. (2b) of 
Section 1.4 for the coherence volume of thermal light, with the solid angle MY replaced 
by 87 t. Now MY represents the solid angle formed by all the directions along w'hich radi¬ 
ation in the cavity can reach the region under consideration. As was mentioned in Section 
7.4, Einstein pointed out in a basic paper on energy fluctuations in blackbody radiation that 
radiation in a thermally insulated cavity may be regarded as a mixture of plane waves 
which propagate in all possible directions, filling a solid angle 4 tt. If we also take into 
account that the waves are unpolarized, i.e. that each consists of two mutually independent 
polarization states (e.g. left- and right-handed circularly polarized waves), one can at once 
understand the origin of the factor I/(8 tt) in Eq. (13). Thus we see that, just as in the case 
discussed in part (a) of this appendix. Eq. (13) derived from the concept of indistinguisha- 
bility of photons in a cell of phase space is in complete agreement with the expression for 
the coherence volume based on classical wave theory. 

Finally we might mention that the concept of cells of phase space is often encountered 
in the physics literature in various disguises. For example, one sometimes speaks of an 
elementary bundle of rays (a concept due to von Laue), which, just like the term “volume 
of an oscillation mode,” means radiation occupying a single cell of phase space. Sometimes 
one finds a reference to the degrees of freedom of a light beam or its Jean's number. These 
are merely alternative terms for the number of cells of phase space. 1 


(c) The degeneracy parameter 

For the analysis of some situations, encountered, for example, in the theory of photoelec¬ 
tric detection of light fluctuations (Section 7.5), it is important to have an estimate for the 
average number of photons contained in a cell of phase space. This quantity is known as the 
degeneracy parameter of the radiation. It differs drastically for thermal and for laser light, 
as we will now' show. 

For blackbody radiation at equilibrium temperature T, the value of the degeneracy 
parameter, 8, at frequency v, is given by the formula 2 


gfivl(k B T) _ | 


(14) 


k B being the Boltzman constant. Figure 2 indicates the values of the degeneracy parameter 
of blackbody radiation for various temperatures and wavelengths A = c/v, calculated from 
Eq. (14). For light of frequency v = 5 X 10 l4 Hz (A = 6 X 10 "'em) from an incandescent 
source at temperature T = 3,000 K, 

f~3X 10“ 4 , (15) 


1 In this connection see also D. Gabor in Progress in Optics , Vol 1. E. Wolf ed. (North-Holland. Amsterdam. 
1961), Section V. p. 146 et seq. 

2 See L. Mandel, J. Opt. Soc. Amer. 51 (1961), 797-798. 
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Fig. 2 The degeneracy parameters of blackbody radiation for various temperatures and 
wavelengths. [Reproduced from W. Martienssen and E. Spiller, Amer. J. Phys. 32 
(1964). 919-926. 


indicating that such light is highly non-degenerate (S 1). In order that <5 ~ 1 at this fre¬ 

quency. it is necessary that the temperature be of the order of 3 X 10 4 K. 

The situation is quite different for laser light. To see this, let us consider a laser with 1 mW 
output power, generating a beam of cross-section 1 mm 2 and having mean wavelength 
A = 6 X 10 'em (u = 5 X 10 l4 Hz). The number of photons per unit volume expressed in 
terms of the energy of a single photon in such a light beam is 

_ photons _ (beam power) 

volume (/tu)((T)(cross-sectional area of beam) 

= _ (1 X 10~ 3 J/s) _ 

(3.3 X 10“ 19 J)(3 X 10 8 m/s)(l X 10” 6 m 2 ) 

= 10 13 photons/m 2 

= 10 7 photons/em 3 . ( 16 ) 

We have seen earlier [Eq. (7) of Section 1.4] that, over a short enough time interval, such a 
laser is sufficiently stable that the coherence volume of the light which it generates is 
AV ~ 300 cm’. Hence, in this case, the degeneracy parameter has the value 

8 = p ■ AV ~ (TO photons/cm 1 ) X 300cm = 3 X 10 y . (17) 

Such light is, therefore, highly degenerate (6 > 1). Comparison of Eqs. (17) and (15) shows 
the remarkable difference of 13 orders of magnitude between the degeneracy of blackbody 
radiation and that of the laser beam. 








Appendix II 


Derivation ofMandel ’s formula for 
photocount statistics 
[Eq. (2) of Section 7.5.1 J 1 


Consider a linearly polarized quasi-monochromatic wave, with fluctuating complex amplitude 
V(t), incident on a photodetector of quantum efficiency a. According to Eq. (1) of Section 
7.5.1, the probability that an electron will then be emitted in a time-interval (t. t + T) is 
given by P(t)At = aI(t)At. 

Let us divide the interval (t, t + T) into T/At short intervals, each of duration At, which 
we will label as 


t t = t + iAt (i = 0, 1,2,..., T/At). (1) 

The probability of obtaining n counts in the interval (/, t + T) is the product of the proba¬ 
bilities of obtaining a count at lime t T{ , a count at time ? r ,.... a count at time t r , multiplied 
by the probability of obtaining no count in the remaining (T/At) — n intervals, summed 
over all possible sequences of the counts. Thus 

T/At T/At T/At , 

pin, t, T) = lim Y. J2 " '72 ~1 )ht , )'" Ct r )(At) n 

T/At 

x n [1 - oddAT] 

/=0 

We note that as At —» 0 the product 


n a - aHt r m. 


( 2 ) 


j =1 


1 [1 - (Xlit )Ar] 1 - nO(At), (3) 


1 The analysis of this appendix follows essentially a derivation given by L. Mandel in Progress in Optics , Vol. II. 
E. Wolfed. (North-Holland. Amsterdam. 1963). pp. 242-248. 
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which evidently tends to unity if n is finite. The multiple summation over the indices 
r i, r 2,. . ., r„ becomes separable and equal to 

\ll 

77 A/ 

E «!(;,) AT , 

r =0 

which tends to 


t + T 

of /(t')dt' 


t 


n 


as At —* 0. The remaining product can be expressed as follows: 


r/i 

n [i - vi{t,)At\ = i - 

j=0 


T/At 

y]n/(r.)At 

i=0 


2 ! 

_1_ 

3! 


TIAi 

E vnt^At 

/=() 

T/At 

E 

f=o 


, r/St 

-■ 1=0 


^EE“ 2 /u ; )/ 2 (9(ao 3 

j t 


( 4 ) 


The terms in the square brackets are all of order zero in At, while the others are of the first 
and higher order in At and become negligible as At —» 0. It follows that 


T/At 


[] [1 - a/(f,)Af] 

1=0 


exp 


exp 


T/At 

-E nl ^t i )At 
»= 0 
f+r 

af /(t)dt' 


(5) 


as At —» 0. Hence Eq. (1) gives Mandel’s formula for photocount statistics, viz.. 


Pin, t, T) = — 


Ml 


/+r 


Q: 


/ /aW 


exp 



(6) 


• . +T' 

Equation (6) is evidently a Poisson distribution in n, with parameter a J I(t')dt'. 




















Appendix III 


The degree of polarization of 
an electromagnetic 
Gaussian Schell-model source 


The cross-spectral density matrix of a Gaussian Schell-model source is, according to 
Eqs. (5) and (6) of Section 9.4, given by the formula 

wf{ P| . p 2 , oi) = fspi Pi . oi) Jsf ( p v w) fif (p 2 - Pl . oi), (i = .v, yj = x. y). 

Cl) 

where the spectral density 

S, ,0) (p; oi) = Afexpl-pOOaf)], (2) 

and the correlation coefficient 

/'f(P 2 “ P,; <o) = B,j exp[— (ps - p,) 2 /(2<5 2 )]» (3) 

with the coefficients fi,y satisfying the constraints given by Eq. (7) of Section 9.4. 

According to Eq. (14) of Section 9.2, the spectral degree of polarization of the electric field 
at a point specified by position vector p in the source plane is given by the general formula 

j*o V w) = L _ 4DetW(p, p^u) (4) 

V [Tr W(p. p, oi)] 2 

When p 2 = Pi = p, Eq. (1) becomes 

W< 0, (p, p, 01 ) = fsflp. 01 ) fsffp, oi) 4 0) (0, 01 ), (i = .V, y; j = x. y). (5) 

It follows that 


Det W (0) (p, p, oi) = W I() 'W (0, - H ,(0, IV (0) 

^ r 7 xv vv xy v.v 

= S«°'(p, oi)STO(p, W )[l - | M <0) (0j wJ |2 ]> (6) 
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where we have used the fact that w ) = //'^*(0, to), the asterisk denoting the complex 

conjugate. 

Further, it follows from Eq. (5) and from the fact that //.„(0, to) = 1 (/ = x, v) that 

TrWtp, p, to) = S^fip, to) + 5j 0, (p. to). (7) 


On substituting from Eqs. (6) and (7) into Eq. (4) we obtain for the degree of polariza¬ 
tion in the plane j = 0 the expression 


!P <0) (p, to) 



4S<°>(p, w)5; 0l (p. oj) 
[5<°»(p, «) + S<°>(p, w)] 2 


[1 


|<’(0. tu)| 2 ]. 


( 8 ) 


On substituting from Eqs. (2) and (3) into this formula we obtain for the degree of polar¬ 
ization the expression 


yorip, oj) 


4A 2 A 2 exp 

1 , 

~ 7 P 

1 1 

a 2 n- 





X V 







^2 

A 2 exp 

_ P" 

2 a 2 

X 

+ A 2 exp 

P 

2cr 2 

V 


(1 - \BJ 2 ). 


(9) 


We note that if 


Eq. (9) reduces to 


4A 2 A 2 


1P |0) (P. to) = 1 -- r - v (1 - 1 B I 2 ) 

V (A 2 + A 2 ) 2 1 


( 10 ) 


01 ) 


This formula implies that when the condition {10) is satisfied, i.e. when the r.nus. widths of 
the spectral densities of the x and the y components of the electric field in the source plane 
z = 0 are the same, the degree of polarization to) is independent of p, i.e. it is the same 

at every source point. If, in addition, \B xy \ = 1 then, according to Eq. (3), |p AV (0, «)| = 1, 
implying that the x and the y components of the electric field are then completely correlated 
at each source point. Equation (11) then gives !P (0, (p, to) = 1, i.e. the degree of polarization 
is now unity at each source point, indicating that the electric field is, in this case, com¬ 
pletely polarized across the source. 




















Appendix IV 


Some important probability 
distributions 


In this appendix we briefly consider some of the most important probability distributions 
and summarize their main properties. 

(a) The binomial (or Bernoulli) distribution and some of its limiting cases 

The binomial distribution, also known as Bernoulli’s distribution, is one of the most impor¬ 
tant probability distributions encountered in classical physics. Some other well-known dis¬ 
tributions such as the Poisson distribution and the Gaussian (or normal) distribution may be 
regarded as limiting cases of it. 

The binomial distribution applies to experiments of the repetitive kind such as, for exam¬ 
ple. a succession of throws of dice. Let P be the probability of a certain event occurring in 
a single trial. Let us determine the probability ppfn, P), (n = 0. 1,2,.. ., AO, of exactly n 
successes occurring in N independent trials. We will denote by S a successful outcome and 
by Fa failure. One possibility of having exactly n successes in A? trials could be an outcome 
indicated by the sequence 

S S S F F F F F F. 

-.--.- ( 1 ) 

n times N—n times 

By the so-called product rule for independent events' the probability of such an outcome is 

P X P X P... X Q X Q X Q... = pnQN-n = p» ( \ _ p)N~n^ 

n times N—n times 

where Q = 1 — P denotes the probability of a failure of the event to occur in a single trial. 

We are actually not interested in the particular sequence (1) but rather in all sequences 
with n P s and (N - n) Q s, i.e. n successes and N—n failures, irrespective of the order of 


1 See, for example. J. F. Kenney and E. S. Keeping. Mathematics of Statistics. Part T\w, second edition (D. Van 
Nostrand. New York, 1951). p. 10. 
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the P s and the Q s. The number of such sequences is evidently the same as the number of 
ways in which one can distribute n particles among N boxes. This number is equal to the 
binomial coefficient 


N C 


n 


N N\ 

n (N — n)ln\ 


(3) 


Using Eqs. (2) and (3) we see that the probability of n successes in N trials is 

PN (n. P) = N C n P n ( 1 - P) N ~” (0 < n «£ N). (4) 

This formula represents the binomial distribution. 

It is not difficult to show that the first two moments of this distribution are 

N 

n = np N (n,P) = NP , (5a) 

n =0 


n 


2 


N 

Y^n-p^n.P) = NP(Q + NP) 


n= 0 


(5b) 


and that the variance 


a 2 = (n - n) 2 = NPQ. (5c) 

It may be shown that in the simultaneous limit of a very large number of trials (N —* ooj 
and a very small probability of success in a single trial (P —> 0), subject to the constraint that 
the mean is still given by Eq. (5a), the binomial distribution given by Eq. (4) can be shown 
to become the Poisson distribution 1 


p(n) = 


n"e " 
n\ 


( 6 ) 


n being, of course, the mean of the distribution. 

The Poisson distribution is found to be a good approximation to the binomial distribu¬ 
tion when simultaneously N > 1 and n <n. 

One can readily verify that the second moment and the variance of the Poisson distribu¬ 
tion are given by the expressions 

n 2 = n + n 2 , (7a) 

cr 2 = n. (7b) 


J. F. Keeney and E. S. Keeping, Mathematics of Statistics, Pan One. third edition (D. van Nostrand. New York. 
(1954), p 153. 
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Another important limiting case of the binomial distribution is the Gaussian distribution 
(also known as the normal distribution) 


p(n) = - e -<»-n) 2 K 2 a 2 > ^ (8) 

<7V27T 

expressed here in terms of its mean n and its variance a 1 . It may be formally derived from 
the binomial distribution as N—»<», under certain constraints. 1 

(b) The Bose-Einstein distribution 

As we noted earlier, the binomial coefficient N C„, defined by Eq. (3), can be understood 
from the analogy with the problem of distributing n particles among N boxes. The particles 
are regarded as physically distinct, i.e. they can be distinguished from each other. 

Towards the end of Section 7.4 we encountered the concept of a cell of phase space, dis¬ 
cussed in Appendix I. In such a domain, photons and some other particles are, as a conse¬ 
quence of Heisenberg’s quantum-mechanical uncertainty principle, indistinguishable from 
each other. It is clear that under these circumstances the binomial distribution will not 
apply. Let us examine this situation. 

The probability that n such particles are located in a cell of phase space is given by Eq. 
(2) as before. However, the binomial coefficient N C„, given by Eq. (3), is no longer appro¬ 
priate because it represents the number of ways in which n distinguishable particles are dis¬ 
tributed. In place of Eq. (4), one then has 

p v (/K P) = AVT'Tl - Pf-'\ (9) 

where the factor K N has to be chosen so that the probability p N is properly normalized. One 
might expect that this factor also depends on n, but more refined analysis shows that, in 
fact, this is not so. 

Equation (9) has to be normalized so that 

^ p N (n, p ) = \. (10) 

n =o 


On substituting from Eq. (9) into Eq. (10) one readily finds that 


K 


N 


1 - a 1 

1 - a N Q N ’ 


(ID 


See, for example, A. Papoulis and S. Unnikrishna Pillai, Probability. Random Variables and Stochastic 
Processes , fourth edition (McGraw-Hill. Boston, 2002), pp. 156— 157, Theorem 5.2. 
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where Q = 1 — P as before and 


P 

a = —. 

Q 

On substituting from Eq. (11) into Eq. (9) we obtain for p N the expression 

1 - a .. 


( 12 ) 


p N (n.P) 


n/~)N ~n — 


l 


viv r>N 


P n Q 


(13) 


Suppose that the number of the particles is very large, i.e. that N 9> and we formally pro¬ 
ceed to the limit as N —> oo. In order that the expression (13) tends to a finite limit we must 
evidently have a < 1. The formula (13) then becomes 

p(n) = (1 - a)a", (14) 

where we have now written pin) in place of P). The formula (14) is one form of the 
so-called Bose—Einstein distribution for one cell of phase space. 

Straightforward calculations give for the mean, for the second moment and for the vari¬ 
ance of this distribution the expressions 

X 

n = Y np(n) = , (15a) 

«=0 1 ° 

_ X 

n- = n 2 p(n) = h, (15b) 

n =0 


<j 2 = (n — n) z = n + n 2 . (15c) 


Using the formula (15a) one may express the Eq. (14) in the form 


P(n) = 


h" 

(n + 1)' ,+1 ’ 


(16) 


which is a more familiar form of the Bose-Einstein distribution for one cell of phase space. 1 


The Bose-Einstein distribution for several cells of phase space is given, for example, in L. Mandel, Proc. Phys. 
Soc. (London). 74 (1959), 233-243. 
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All optical fields undergo random fluctuations. They 
may be small, as in the output of many lasers, or they 
may be appreciably larger, as in light generated by 
thermal sources. The underlying theory of fluctuating 
optical fields is known as coherence theory. An 
important manifestation of the fluctuations is the 
phenomenon of partial polarization. Actually, coherence 
theory deals with considerably more than fluctuations. 

Unlike usual treatments, it describes optical fields in terms of observable quantities 
and elucidates how such quantities, tor example, the spectrum of light, change as 
light propagates. 

This book is the first to provide a unified treatment of the phenomena of coherence 
and polarization. The unification has been made possible by very recent 
discoveries, largely due to the author of this book. 

The subjects treated in this volume are of considerable importance for graduate 
students and for research workers in physics and in engineering, who are 
concerned with optical communications, with propagation of laser beams through 
fibers and through the turbulent atmosphere, with optical image formation, 
particularly in microscopes, and with medical diagnostics, for example. Each 
chapter contains problems to aid self-study. 
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